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GLIAL CELL LINE-DERIVED NEUROTROPHIC FACTOR RECEPTOR 



1 . Field of the Invention 

5 

The present invention relates to receptors for glial cell line-derived neurotrophic 
factor (GDNF) and provides nucleic acid and amino acid sequences encoding GDNF 
receptor (GDNFR). The present invention also relates to therapeutic techniques for the 
treatment of GDNF-responsive conditions. 

10 

2 . Background of the Invention 

Glial Cell line-Derived Neurotrophic Factor 

Glial cell line-derived neurotrophic factor (GDNF) was initially isolated and 

15 cloned from rat B49 cells as a potent neurotrophic factor that enhances survival of 
midbrain dopaminergic neurons (Lin et al., Science, 260, 1 130-1 132, 1993). Recent 
studies have indicated that this molecule exhibits a variety of other biological activities, 
having effects on several types of neurons from both the central and peripheral 
nervous systems. In the central nervous system (CNS), GDNF has been shown to 

20 prevent the axotomy-induced death of mammalian facial and spinal cord motor neurons 
(Li et al., Proceedings Of The National Academy Of Sciences, U.S.A., 92, 9771- 
9775, 1995; Oppenheim et al M Nature, 373, 344-346, 1995; Yan et al., Nature, 373, 
341-344, 1995; Henderson et al., Science, 266, 1062-1064, 1994; Zurn et al., 
Neuroreport, 6, 113-118, 1994), and to rescue developing avian motor neurons from 

25 natural programmed cell death (Oppenheim et al., 1995 supra). Local administration 
of GDNF has been shown to protect nigral dopaminergic neurons from axotomy- 
induced (Kearas and Gash, Brain Research, 672, 104-1 1 1, 1995; Beck et al., Nature, 
373, 339-341, 1995) or neurotoxin-induced degeneration (Sauer et al., Proceedings 
Of The National Academy Of Sciences U.S.A., 92, 8935-8939, 1995; Tomac et al., 

30 Nature, 373, 335-339, 1995). In addition, local administration of GDNF has been 
shown to induce sprouting from dopaminergic neurons, increase levels of dopamine, 
noradrenaline, and serotonin, and improve motor behavior (Tomac et al., 1995 supra). 

More recently, GDNF has been reported to be a potential trophic factor for 
brain noradrenergic neurons and Purkinje cells. Grafting of fibroblasts ectopically 

35 expressing GDNF prevented 6-hydroxydopamine-induced degeneration and promoted 
the phenotype of adult noradrenergic neurons in vivo (Arenas et al., Neuron, 15, 
1465-1473, 1995), while exogeneously applied GDNF effectively promoted survival 
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and morphological differentiation of embryonic Purkinje cells in vitro (Mount et al., 
Proceedings Of The National Academy Of Sciences U.S.A., 92, 9092-9096, 1995). 
In the peripheral nervous system, GDNF has been shown to promote the survival of 
neurons in nodose, ciliary, and sympathetic ganglia, as well as small populations of 
5 embryonic sensory neurons in dorsal rooi ganglia (DRG) and trigeminal ganglia 

(Trupp et al., Journal Of Cell Biology, 130, 137-148, 1995; Ebendal et al., Journal Of 
Neuroscience Research, 40, 276-284, 1995; Oppenheim et al., 1995 supra; Yan et al., 
1995 supra; Henderson et al., 1994 supra). GDNF has also been reported to enhance' 
the expression of vasoactive intestinal peptide and preprotachykinin-A mRNA in 
10 cultured superior cervical ganglion (SCG) neurons, and thus effects the phenorype of 
SCG neurons and induces bundle-like sprouting (Trupp et al., 1995 supra). 

Expression of GDNF has been observed in a number of different cell types and 
structures of the nervous system. In the CNS, GDNF mRNA expression has been 
observed by reverse transcriptase polymerase chain reaction (RT-PCR) in both 
15 developing and adult rat striatum, the major target of nigral dopaminergic innervation, 
and widely in other regions, including hippocampus, cortex, thalamus, septum, 
cerebellum, spinal cord, and medulla oblongata (Arenas et al M supra 1995; Poulsen et 
ah, Neuron, 13, 1245-1252, 1994; Springer et al., Experimental Neurology, 127, 
167-170, 1994; Stroemberg et al. t Experimental Neurology, 124, 401-412, 1993; 
20 Schaar et al., Experimental Neurology, 124, 368-371, 1993). In human, GDNF 
transcripts have also been detected in striatum, with highest level in the caudate and 
lower levels in the putamen. Detectable levels are also found in hippocampus, cortex, 
and spinal cord, but not in cerebellum (Schaar et al., Experimental Neurology, 130, 
387-393, 1994; Springer et al, 1994 supra). In the periphery, GDNF mRNA 
25 expression has been reported in DRG and SCG of postnatal day 1 rats, sciatic nerve, 
• and primary cultures of neonatal Schwann cells (Trupp et al., 1995 supra; Hoffer et 
al., Neuroscience Letters, 182, 107-111, 1994; Henderson etal., 1994 supra; 
Springer et al., 1994 supra). In addition, recent studies have shown that GDNF 
transcripts are also widely expressed in peripheral non-neuronal organs, including 
30 postnatal testis and kidney, embryonic whisker pad, stomach, and skin. Expression 
can be detected at lower levels in embryonic muscle, adrenal gland and limb bud, and 
in postnatal lung, liver and ovary (Trupp et al., 1995 supra; Henderson et al., 1994 
supra). So far, however, the biological significance of the non-neuronal expression of 
GDNF is not clear. 

35 Detailed descriptions of the preparation and characterization of GDNF protein 

products may be found in U.S. Patent Application No. 08/182,183 filed May 23, 
1994 and its parent applications (also see PCT/US92/07888, WO 93/061 16 filed 
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September 17, 1992 and European Patent Application No. 92921022.7, Publication 
No. EP 610 254) the disclosures of which are hereby incorporated by reference. 
Additional GDNF protein products are described in pending U.S. Patent Application 
No. 08/535,681 filed September 28, 1995, the disclosure of which is hereby 
5 incorporated by reference. As used herein, the term "GDNF protein product" includes 
biologically active synthetic or recombinant GDNF proteins and analogs, as well as 
chemically modified derivatives thereof. GDNF analogs include deletion variants such 
as truncated GDNF proteins, as well as insertion and substitution variants of GDNF. 
Also included are GDNF proteins that are substantially homologous to the human 
10 GDNF protein. 

GDNF Therapy 

GDNF therapy is helpful in the treatment of nerve damage caused by 
conditions that compromise the survival and/or proper function of one or more types 

15 of nerve cells. Such nerve damage may occur from a wide variety of different causes. 
Nerve damage may occur to one or more types of nerve cells by: (1) physical injury, 
which causes the degeneration of the axonal processes and/or nerve cell bodies near 
the site of injury; (2) temporary or permanent cessation of blood flow to parts of the 
nervous system, as in stroke; (3) intentional or accidental exposure to neurotoxins, 

20 such as chemotherapeutic agents (e.g., cisplatinum) for the treatment of cancer or 

dideoxycytidine (ddC) for the treatment of AIDS; (4) chronic metabolic diseases, such 
as diabetes or renal dysfunction; or (5) neurodegenerative diseases such as 
Parkinson's disease, Alzheimer's disease, and amyotrophic lateral sclerosis (ALS), 
which result from the degeneration of specific neuronal populations. 

25 Several studies indicate that GDNF therapy is particularly helpful in the 

treatment of neurodegenerative conditions such as the degeneration of the 
dopaminergic neurons of the substantia nigra in Parkinson's disease. The only current 
treatments for Parkinson's disease are palliative, aiming at increasing dopamine levels 
in the striatum. The expected impact of GDNF therapy is not simply to produce an 

30 increase in the dopaminergic neurotransmission at the dopaminergic nerve terminals in 
the striatum (which will result in a relief of the symptoms), but also to slow down, or 
even stop, the progression of the degenerative processes and to repair the damaged 
nigrostriatal pathway and restore its function. GDNF may also be used in treating 
other forms of damage to or improper function of dopaminergic nerve cells in human 

35 patients. Such damage or malfunction may occur in schizophrenia and other forms of 
psychosis. The only current treatments for such conditions are symptomatic and 
require drugs which act upon dopamine receptors or dopamine uptake sites, consistent 
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with the view that the improper functioning of the dopaminergic neurons which 
innervate these receptor-bearing neuronal populations may be involved in the disease 
process. 



20 



25 



30 



35 



> Receptors 

A number of receptors which mediate binding and response to protein factors 
have been characterized and molecularly cloned, including receptors for insulin 
platelet derived growth factor, epidermal growth factor and its relatives, the fibroblast 
growth factors, various interleukins, hematopoietic growth factors and ciliary 
neurotrophic factor (U.S. 5,426, 1 77). Study results indicate that some receptors can 
bind to multiple (related) growth factors, while in other cases the same factor can bind 
and activate multiple (related) receptors (e.g., Lupu et al., Science, 249:1552-1555 
1990; Dionne et al., EMBO J., 9:2685-2692, 1990; Miki et al., Science, 251 72-75 
1991). Most receptors can broadly be characterized as having an extracellular portion 
or domain responsible for specifically binding a protein factor, a transmembrane 
domain which spans the cell membrane, and an intracellular domain that is often 
involved m initiating signal transduction upon binding of the protein factor to the 
receptor's extracellular portion. Although many receptors are comprised of a single 
polypeptide chain, other receptors apparently require two or more separate subunits in 
order to bmd to their protein factor with high-affinity and to allow functional response 
following binding (e.g., Hempstead et al., Science, 243:373-375, 1989; Hibi et al 
Cell, 63:1149-1157, 1990). 

The extracellular and intracellular portions of a given receptor may share 
common structural motifs with the corresponding regions of other receptors 
suggestmg evolutionary and functional relationships between different receptors 
These relationships can often be quite distant and may simply reflect the repeated use 
of certain general domain structures. For example, a variety of different receptors that 
bmd unrelated factors make use of "immunoglobulin" domains in their extracellular 
portions, while other receptors utilize "cytokine receptor" domains in their factor- 
binding regions (e.g., Akira et al., The FASEB J., 4:2860-2867, 1990). A large 
number of receptors with distinct extracellular binding domains (which thus bind 
different factors) contain related ^cytoplasmic domains encoding tyrosine-specific 
protein kinases that are activated in response to factor binding (e.g., Ullrich and 
Schlessinger, Cell, 61:203-212, 1990). The mechanisms by which factor-binding 
activates" the signal transduction process is poorly understood, even in the case of 
receptor tyrosine kinases. For other receptors, in which the intracellular domain 
encodes a domain of unknown function or in which the binding component associates 
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with a second protein of unknown function (e.g., Hibi et al., Cell, 63:1 149-1 157, 

1990), activation of signal transduction is not well characterized. 

The mode of action of GDNF in vivo is not clearly elucidated in the art, in part 

due to the absence of information on a receptor for GDNF. Two groups have 

5 independently found that striatum injected [125j].j a beled GDNF can be retrogradely 

transported by dopaminergic neurons in the substantia nigra (Tomac et al., 

Proceedings Of The National Academy Of Sciences Of The United States Of America. 

92, 8274-8278, 1995; Yan et al., 1995 supra). Retrograde transport of [ 125 I]-GDNF 

by spinal cord motor neurons, DRG sensory neurons and neurons in the B layer of 

10 retina ganglia was also been observed. These retrograde transport phenomena can all 

be specifically inhibited by 100-fold or higher concentrations of unlabeled GDNF, 

suggesting a saturable, receptor-mediated transport process. In vitro, recombinant 

GDNF has been shown to enhance the survival and promote dopamine uptake of 

cultured dopaminergic neurons at very low concentrations. The observed half- 
15 maximal effective concentration (EC50) of GDNF on these neurons is 0.2 to 1 .6 pM 

(Lin et al., 1993 supra). GDNF has also been shown to support the survival of 
dissociated motor neurons at low concentrations. The reported EC50 of GDNF on 

motor neurons, in a 5 to 10 fM range, is even lower than that on dopaminergic 
neurons (Henderson et al., 1994 supra). 

20 Taken together, these observations indicate that receptors) for GDNF 

expressed in these cells have very high ligand binding affinities. Similar to members 
of the TGF-6 family, the widely diversified tissue distribution and varied biological 
function of GDNF on different populations of cells suggest that different types of 
receptors) for GDNF or receptor complexes may exist. Saturation steady-state and 

25 competitive binding of [ * 25[]_GDNF to E 1 0 chick sympathetic neurons has shown 
that these neurons express GDNF binding sites differing from those observed in 
dopaminergic and motor neurons. The half maximal saturation concentration and the 
half-maximal inhibition concentration of GDNF on these binding sites is in the range 
of 1 to 5 nM (Trupp et al., 1995 supra). Similarly, the EC50 °f GDNF in supporting 

30 the survival of sympathetic neurons from PI rat SCG has also been reported to be in 
the nanomolar range (Trupp et al., 1995 supra). 

To better understand the mechanism by which GDNF activates signal 
transduction to exert its affects on cells, it would be beneficial to identify the 
receptors) which mediate binding and response to this protein factor. It would also be 

35 beneficial for GDNF therapy to identify and make possible the production of accessory 
molecules which provide for or enhance GDNF signal transduction. Moreover, the 
identification of a protein receptor for GDNF would provide powerful applications in 



WO 97/40152 




PCT/US97/06281 



6 



diagnostic uses, for example, as an aid in determining if individuals would benefit 
from GDNF protein therapy. Furthermore, the protein receptor for GDNF could be a 
key component in an assay for identifying additional molecules which bind to the 
receptor and result in desired biological activity. 

c 

SUMMARY OF THE INVENTION 

The present invention provides nucleic acid sequences which encode a 
10 neurotrophic factor receptor protein having an amino acid sequence as depicted in 
Figures 2 and 4 (SEQ. ID. NOs.: 2 and 4) as well as biologically equivalent analogs. 
The neurotrophic factor receptor protein and protein products of the present invention 
are designated herein as glial cell line-derived neurotrophic factor receptor (GDNFR) 
protein and protein products. The novel GDNFRs are functionally characterized by 
1 5 the ability to bind GDNF specifically and with high affinity, and to act as part of a 
molecular complex which mediates or enhances the signal transduction affects of 
GDNF. GDNFR protein products are typically provided as a soluble receptor protein 
and in a substantially purified form. 

In one aspect, the present invention provides for the production of GDNFR 
20 protein products by recombinant genetic engineering techniques. In an alternative 
embodiment, the GDNFR proteins are synthesized by chemical techniques, or a 
combination of the recombinant and chemical techniques. 

In another aspect of the present invention, the GDNFR proteins may be made 
in glycosylated or non-glycosylated forms. Derivatives of GDNFR protein typically 
25 involve attaching the GDNFR protein to a water soluble polymer. For example, the 
GDNFR protein may be conjugated to one or more polyethylene glycol molecules to 
decrease the precipitation of the GDNFR protein product in an aqueous environment. 

Yet another aspect of the present invention includes the various polynucleotides 
encoding GDNFR proteins. These nucleic acid sequences are used in the expression 
30 of GDNFR in a eukaryotic or prokaryotic host cell, wherein the expression product or 
a derivative thereof is characterized by the ability to bind to GDNF and thereby form a 
complex capable of mediating GDNF activity, such as increasing dopamine uptake by 
dopaminergic cells. The polynucleotides may also be used in cell therapy or gene 
therapy applications. Suitable nucleic acid sequences include those specifically 
35 depicted in the Figures as well as degenerate sequences, naturally occurring allelic 
variations and modified sequences based on the present invention. 

Exemplary nucleic acid sequences include sequences encoding a neurotrophic 
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factor receptor protein comprising an amino acid sequence as depicted in Figures 2 
and 4 (SEQ ID NOs. 2 and 4) capable of complexing with glial cell line-derived 
neurotrophic factor (GDNF) and mediating cell response to GDNF, and biologically 
equivalent analogs thereof. Such sequences include: (a) a sequence set forth in Figure 
5 1 (SEQ ID NO. 1) comprising nucleotides encoding Met 1 through Sef^S or Figure 3 
(SEQ ID NO. 3) comprising nucleotides encoding Met' through Sei 4 ^ encoding a 
neurotrophic factor receptor (GDNFR) capable of complexing with glial cell line- 
derived neurotrophic factor (GDNF) and mediating cell response to GDNF; (b) a 
nucleic acid sequence which (1) hybridizes to a complementary sequence of (a) and (2) 
10 encodes an amino acid sequence with GDNFR activity; and (c) a nucleic acid sequence 
which but for the degeneracy of the genetic code would hybridize to a complementary 
sequence of (a) and (2) encodes an amino acid sequence with GDNFR activity. Also 
disclosed herein are vectors such nucleic acid sequences wherein the sequences 
typically are operatively linked to one or more operational elements capable of effecting 
15 the amplification or expression of the nucleic acid sequence. Host cells containing 
such vectors are also contemplated. Typically, the host cell is selected from 
mammalian cells and bacterial cells, such as a COS-7 cell or E. coli, respectively. 

A further aspect of the present invention involves vectors containing the 
polynucleotides encoding GDNFR proteins operatively linked to amplification and/or 
20 expression control sequences. Both prokaiyotic and eukaryotic host cells may be 

stably transformed or transfected with such vectors to express GDNFR proteins. The 
present invention further includes the recombinant production of a GDNFR protein 
wherein such transformed or transfected host cells are grown in a suitable nutrient 
medium, and the GDNFR expressed by the cells is, optionally, isolated from the host 
25 cells and/or the nutrient medium. The present invention further includes the use of 
polynucleotides encoding GDNFR and vectors containing such polynucleotides in 
gene therapy or cell therapy. 

The host cell may also be selected for its suitability to human implantation, 
wherein the implanted cell expresses and secretes a neurotrophic factor receptor of the 
30 present invention. The host cell also may be enclosed in a semipermeable membrane 
suitable for human implantation. The host cell may be transformed or transfected ex 
vivo. An exemplary device for treating nerve damage involves: (a) a semipermeable 
membrane suitable for implantation; and (b) cells encapsulated within the membrane, 
wherein the cells express and secrete a neurotrophic factor receptor as disclosed 
35 herein. The membrane is selected from a material that is permeable to the neurotrophic 
factor receptor protein but impermeable to materials detrimental to the encapsulated 
cells. 
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Methods for the recombinant production of a neurotrophic factor receptor of 
the present invention are also disclosed. An exemplary methods involves: (a) 
culturing a host cell containing a nucleic acid sequence encoding a neurotrophic factor 
receptor of the present invention, such as an amino acid sequence depicted in Figures 2 

5 and 4 (SEQ ID NOs. 2 and 4) capable of completing with glial ceil line-derived 
neurotrophic factor (GDNF) and mediating cell response to GDNF, or biologically 
equivalent analogs thereof; (b) maintaining said host cell under conditions suitable for 
the expression of said neurotrophic factor receptor by said host cell; and (c) optionally, 
isolating said neurotrophic factor receptor expressed by said host cell. The host cell 

10 may be a prokaiyotic cell or a eukaryotic cell If bacterial expression is involved, the 
method may ftuther include the step of refolding the neurotrophic factor receptor. 

The present invention includes an isolated and purified protein comprising an 
amino acid sequence as depicted in Figures 2 and 4 (SEQ ID NOs. 2 and 4) capable of 
complexing with glial cell line-derived neurotrophic factor (GDNF) and mediating cell 

15 response to GDNF, and biologically equivalent analogs thereof. Exemplary analogs 
include, but are not limited to, proteins comprising the amino acid sequence Ser ^ 
through Pro 446 , Asp 25 through Leu 447 and Cys 29 through Cys 442 as depicted in 
Figure 2 (SEQ ID NO:2) as well as proteins comprising the amino acid sequence 
Met 17 through Pro 449 and Cys 29 through Cys 443 as depicted in Figure 4 (SEQ ID 

20 NO:4). The proteins of the present invention may be glycosylated or non-glycosylated 
and may be produced by recombinant technology or chemical synthesis. The present 
invention further includes nucleic acid sequences encoding a receptor protein 
comprising such amino acid sequences. 

Also disclosed herein are pharmaceutical compositions comprising a protein 

25 receptor of the present invention in combination with a pharmaceutically acceptable 
carrier. A variety of other formulation materials may be used to facilitate manufacture, 
storage, handling, delivery and/or efficacy. 

Another aspect of the present invention includes the therapeutic use of GDNFR 
genes and proteins. For example, a circulating or soluble GDNFR protein product 

30 may be used alone or in conjunction with GDNF in treating disease of or injury to the 
nervous system by enhancing the activity of transmembrane signaling of GDNF. 
Thus, the proteins and pharmaceutical compositions of the present invention may be 
used in treating improperly functioning dopaminergic nerve cells, Parkinson's disease, 
Alzheimer's disease and amyotrophic lateral sclerosis. Alternatively, a recombinant 

35 GDNFR gene may be inserted in the cells of tissues which would benefit from 
increased sensitivity to GDNF, such as motor neurons in patients suffering from 
amyotrophic lateral sclerosis. In yet another embodiment, GDNFR may be used to 
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block GDNF activity in cases where GDNF activity is thought to be detrimental. The 
GDNFR may be used to verify that observed effects of GDNF are due to the GDNFR. 

In another aspect of the invention, GDNFR probes may be used to identify 
cells and tissues which are responsive to GDNF in normal or diseased states. 
5 Alternatively, the probes may be used to detect an aberrancy of GDNFR expression in 
a patient suffering from a GDNF-related disorder. 

In a further aspect of the invention, GDNFR probes, including nucleic acid as 
well as antibody probes, may be used to identify GDNFR-related molecules. For 
example, the present invention provides for such molecules which form a complex 

1 0 with GDNFR and thereby participate in GDNFR function. As another example, the 
present invention provides for receptor molecules which are homologous or cross- 
reactive antigenically, but not identical to GDNFR. 

The present invention also provides for the development of both binding and 
functional assays for GDNF based on the receptor. For example, assay systems for 

15 detecting GDNF activity may involve cells which express high levels of GDNFR, and 
which are therefore extremely sensitive to even very low concentrations of GDNF or 
GDNF-like molecules. In yet another embodiment, soluble GDNFR may be used to 
bind or detect the presence of GDNF or GDNF-like molecules. 

In addition, the present invention provides for experimental model systems for 

20 studying the physiological role of GDNF. Such systems include assays involving 
anti-GDNFR antibodies or oligonucleotide probes as well as animal models, such as 
transgenic animals which express high levels of GDNFR and therefore are 
hypersensitive to GDNF or animals derived using embryonic stem cell technology in 
which the endogenous GDNFR genes were deleted from the genome. An anti- 

25 GDNFR antibody will binds a peptide portion of the neurotrophic factor receptor 
proteins. Antibodies include monoclonal and polyclonal antibodies. Alternatively, 
immunological tags for which antibodies already exist may be attached to the GDNFR 
protein to aid in detection. Such tags include but are not limited to Flag (BI/Eastman 
Kodak) and myc sequences. Other tag sequences such as polyhistidine have also been 

30 used for detection and purification on metal chelating columns. 

Additional aspects and advantages of the invention will be apparent to those 
skilled in the art upon consideration of the following description, which details the 
practice of the present invention. 



WO 97/40152 




PCT/US97/06281 



10 



BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 depicts a nucleic acid sequence (SEQ ID NO: 1 ) encoding human glial 
ceil line-derived neurotrophic factor receptor (GDNFR). The amino acid sequence of 
5 the full length GDNFR protein is encoded by nucleic acids 540 to 1934. 

Figure 2 depicts the amino acid sequence (SEQ ID NO:2) of the full length 
human GDNFR protein. 

10 Figure 3 depicts a nucleic acid sequence (SEQ ID NO:3) encoding rat GDNFR. 

The amino acid sequence of the full length GDNFR protein is encoded by nucleic acids 
302 to 1705. 

Figure 4 depicts the amino acid sequence (SEQ ID NO:4) of the full length rat 
15 GDNFR protein 

Figure 5 depicts the alignment and comparison of portions of GDNFR cDN A 
sequences produced in various clones as well as the consensus sequence for human 
GDNFR. 

20 

Figure 6 depicts the identification of Neuro-2A derived cell lines expressing 
GDNFR. 

Figure 7A and 7B depict the results of the equilibrium binding of [ 125 I]GDNF 
to cells expressing GDNFR. 

Figure 8 depicts the results of the chemical cross-linking of [^ijqdnf to 
GDNFR and Ret Expressed in cells expressing GDNFR. 

Figure 9 depicts the results of the induction of c-Ret autophosphorylation by 
GDNF in cells expressing GDNFR. 

Figure 10 depicts the results of the induction of c-Ret autophosphorylation by 
GDNF and soluble GDNFR. 



Figure 1 1 depicts the results of the blocking of c-Ret autophosphorylation by a 
Ret-Fc fusion protein. 
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Figure 12 depicts the results of the induction of c-Ret autophosphorylation by 
GDNF in motor neurons. 

5 Figure 13 depicts a model for GDNF signaling mediated by GDNFR and Ret. 

DETAILED DESCRIPTION OF THE INVENTION 

10 Glial cell line-derived neurotrophic factor (GDNF) is a potent neurotrophic 

factor which exhibits a broad spectrum of biological activities on a variety of cell types 
from both the central and peripheral nervous systems. It is a glycosylated, disulfide- 
linked dimer which is distantly related (less than 20% homology) to the transforming 
growth factor-B (TGF-B) superfamily. GDNFs ability to enhance the survival of 

15 dopaminergic neurons and other neuron populations demonstrates its therapeutic 
potential for the treatment of Parkinson's disease as well as other forms of nerve 
damage or malfunction. 

In contrast to the extensive studies on the distribution and bioactivity of 
GDNF, there have been no reports on the identification of a receptor or receptors 

20 which mediate binding of GDNF to a cell and thereby mediate intracellular signaling 
and a cell response. The present invention is based upon the discovery of a high 
affinity receptor first found on the surface of cultured retinal cells from postnatal rats. 
These receptors possess an estimated GDNF binding affinity comparable to that of the 
receptors found in dopaminergic and motor neurons; midbrain dopaminergic neurons 

25 (Lin et al., 1993 supra; Sauer et al., 1995 supra; Kearns and Gash, 1995 supra; Beck 
etal., 1995 supra; Toroac et al., 1 995a supra), facial and spinal cord motor neurons 
(Li et al., 1995 supra; Oppenheim et al., 1995 supra; Yan et al., 1995 supra; Zura et 
al., 1994 supra; Henderson et al., 1994 supra). The receptor molecule has been 
named GDNF receptor (GDNFR) since it is the first known component of a receptor 

30 system for GDNF. The present invention also provides the first description of the 
expression cloning and characterization of GDNFR protein. Cells modified to express 
the recombinant receptor bind GDNF with high affinity. 

Using a dopamine uptake assay and [^Sij-GDNF binding on cultured cells, 
high affinity receptors to GDNF were detected on the surface of rat photoreceptor 

35 cells. As further described in the Examples, the study of photoreceptor cells lead to 
the isolation of a cDNA clone by expression cloning for GDNF receptor. The nucleic 
acid sequence for GDNFR encodes a protein of 468 amino acids with 3 1 cysteine 
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residues and three potential N-glycosylation sites. Next, a nucleic acid sequence from 
the rat cDNA clone was used to isolate its human homolog which was found to be 
nearly identical to the rat receptor at the amino acid level. The human GDNFR cDNA 
sequence encodes a protein of 465 amino acids with the positions of all cysteine 
5 residues and potential N-glycosylation sites conserved relative to the rat receptor. This 
high degree of primary sequence conservation indicated an important role for this 
receptor in the biological function of GDNF. 

As discussed above, many receptors have three main domains: an extracellular 
or cell surface domain responsible for specifically binding a protein factor, a 

10 transmembrane domain which spans the cell's membrane; and an intracellular or 
cytoplasmic domain that is typically involved in initiating signal transduction when a 
protein factor binds to the extracellular domain. It was determined, however, that 
GDNFR is unrelated in sequence or structural characteristics to any known protein 
(such as the consensus sequences found in either receptor kinases or cytokine 

1 5 receptors), lacks a cytoplasmic domain, lacks the C-tenninal charged residues 
characteristic of a transmembrane domain and is anchored to the cell membrane by 
glycosyl-phosphatidylinositol (GPI) linkage, as described in greater detail below. 
Although the absence of an intracellular catalytic domain precluded a direct role in 
transmembrane signaling, the high binding affinity and strong evolutionary sequence 

20 conservation further suggested that this receptor was important for GDNF function. 

Because GDNFR lacks a cytoplasmic domain, it was thought that this receptor 
must act in conjunction with one or more accessory molecules which play a role in 
transmembrane signaling. It was then discovered that transgenic mice which lack the 
gene for GDNF die and have no kidneys. Transgenic mice which lack the gene for 

25 c-ret proto-oncogene (Schuchardt, et al., Nature, 367, 380-383, 1994) were found to 
have a similar phenotype. The c-ret proto-oncogene encodes a receptor tyrosine kinase 
(RTK) whose normal function had not yet been determined. All RTKs have a similar 
topology: they possess an extracellular ligand-binding domain, a transmembrane 
domain and a cytoplasmic segment containing the catalytic protein-tyrosine kinase 

30 domain. Binding of a ligand leads to the activation of the kinase domain and 

phosphorylation of specific substrates in the cell that mediate intracellular signaling. 
The present invention involves the discovery that a soluble form of GDNFR may be 
used to mediate the binding of GDNF to the c-ret proto-oncogene and thereby elicit a 
cellular response to GDNF as well as modify its cell-type specificity. 

35 Similar species, called "receptor alpha" components, provide ligand binding 

specificity but do not have the capacity to transduce signal on their own. Such 
components are found in the ciliary neurotrophic factor (CNTF) and interleukin-6 (IL- 
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6) receptor systems. Like GDNFR, and in contrast to IL-6 receptor, CNTF receptor 
binds its ligand with high affinity, has a hydrophobic C-terminus, no cytoplasmic 
domain, and is anchored to the cell membrane by GPI linkage (Davis et al., 1991). In 
order to mediate signal transduction, CNTF binds first to CNTF receptor, creating a 
5 complex which is able to bind gpl30. This inactive complex then binds to LIF 

receptor to form the active signaling complex (Davis, et al., Science, 260, 1805-1807, 
1993). As with the present invention, CNTF receptor (the ligand specific binding 
component) must be present for signaling to occur but it need not be membrane bound 
(Economides et ai., Science, 270, 1351-1353, 1995). 

1 0 As further described below, GDNFR protein may be anchored to a cell 

surface, or it may be provided in a soluble form. In either case, GDNFR protein 
forms a ligand complex with GDNF, and the ligand complex binds to cell surface 
receptor to effectuate intracellular signaling. Thus, a soluble form of GDNFR may be 
used to potentiate the action of GDNF and/or modify its cell-type specificity. 

15 GDNFR is unrelated to any known receptor. There are no apparent matches in 

the GenBank and Washington University-Merck databases for related sequences. An 
expressed sequence tag (EST) found in the Washington University-Merck EST 
database shows 75% homology to a small portion of the coding region of GDNFR 
(approximately 340 nucleotides of the 521 nucleotides of sequence generated from the 

20 5 1 end of the clone). This clone (GenBank accession #H12981) was isolated from an 
oligo-dT primed human infant brain library and cloned directionally into the Laftnid 
BA vector (Hillier, L. et al, unpublished data). The 3" end of the #HI2981 clone has 
been sequenced, but it exhibits no homology to any part of the GDNFR. The 
appearance of homology between this #H12981 clone and GDNFR over a short 

25 region, which homology then disappears, suggests that the #H1298 1 clone represents 
an unspliced transcript, or cloning artifact rather than a bona fide cDNA transcript 

Thus, the present invention enables the cloning of GDNFR protein by 
providing a method for selecting target cells which express GDNFR. By providing a 
means of enriching for GDNFR encoding sequences, the present invention further 

30 provides for the purification of GDNFR protein and the direct cloning of GDNFR- 
encoding DNA. The present description of the GDNFR nucleic acid and amino acid 
sequences provides the information needed to reproduce these entities as well as a 
variety of GDNFR analogs. With this information, GDNFR protein products may be 
isolated or generated by any means known to those skilled in the an. A variety of 

35 means for the recombinant or synthetic production of GDNFR protein are disclosed. 

As used herein, the term "GDNFR protein product" includes biologically active 
purified natural, synthetic or recombinant GDNFR, GDNFR analogs (i.e., GDNFR 
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homologs and variants involving insertion, substitution and deletion variations), and 
chemically modified derivatives thereof. GDNFR analogs are substantially 
homologous to the GDNFR amino acid sequences set forth in Figures 2 and 4 (SEQ 
ID NOs:2 and 4). 

5 The term "biologically active", as used herein, means thai the GDNFR protein 

product demonstrates high affinity binding to GDNF and mediates or enhances 
GDNF-induced signal transduction. Using the present disclosure, it is well within the 
ability of those of ordinary skill in the art to determine whether a GDNFR polypeptide 
analog has substantially the same biological activity as the GDNFR protein products 
10 set forth in Figures 2 and 4. 

The term "substantially homologous" amino acid sequence, as used herein, 
refers to an amino acid sequence sharing a degree of "similarity'* or homology to the 
GDNFR amino acid sequences set forth in Figures 2 and 4 such that the homologous 
sequence has a biological activity or function similar to that described for these 
1 5 GDNFR amino acid sequences. It will be appreciated by those skilled in the art, that a 
relatively large number of individual or grouped amino acid residues can be changed, 
positionally exchanged (e.g.s, reverse ordered or reordered) or deleted entirely in an 
amino acid sequence without affecting the three dimensional configuration or activity 
of the molecule. Such modifications are well within the ability of one skilled in the art 
20 following the present disclosure. The identification and means of providing such 
modified sequences are described in greater detail below. It is preferable that the 
degree of homology of a substantially homologous protein (peptide) is equal to or in 
excess of 70% (i.e., a range of from 70% to 100% homology). Thus, a preferable 
"substantially homologous** GDNFR amino acid sequence may have a degree of 
25 homology greater than or equal to 70% of the amino acid sequences set forth in SEQ 
ED NOs:2 and 4. More preferably the degree of homology may be equal to or in 
excess of 85%. Even more preferably it is equal to or in excess of 90%, or most 
preferably it is equal to or in excess of 95%. 

The percentage of homology as described herein is calculated as the percentage 
30 of amino acid residues found in one protein sequence which align with identical or 
similar amino acid residues in the second protein sequence. Thus, in the case of 
GDNFR homology, the degree of sequence homology may be determined by 
optimally aligning the amino acid residues of the comparison molecule to those of a 
reference GDNFR polypeptide, such as depicted in SEQ ID NOs: 2 and 4 or those 
35 encoded by the nucleic acid sequences depicted in the Figures, to maximize matches of 
residues between the two sequences. It will be appreciated by those skilled in the art 
that such alignment may include appropriate conservative residue substitutions and will 
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disregard truncations and internal deletions or insertions of the comparison sequence 
by introducing gaps as required; see, for example Dayhoff, Atlas of Protein Sequence 
and Structure Vol. 5, wherein an average of three or four gaps in a length of 100 
amino acids may be introduced to assist in alignment (p. 124, National Biochemical 

5 Research Foundation, Washington, D.C., 1972; the disclosure of which is hereby 
incorporated by reference). Once so aligned, the percentage is determined by the 
number of aligned residues in the comparison polypeptide divided by the total number 
of residues in the comparison polypeptide. It is further contemplated that the GDNFR 
sequences of the present invention may be used to form a portion of a fusion protein or 

10 chimeric protein which has, at least in part, GDNFR activity. The alignment and 
homology of such a protein would be determined using that portion of the fusion 
protein or chimeric protein which is related to GDNFR activity. 

The sources of such substantially homologous GDNFR proteins include the 
GDNFR proteins of other mammals which are expected to have a high degree of 

15 homology to the human GDNFR protein. For example, the degree of homology 
between the rat and human GDNFR proteins disclosed herein is about 93%. 
Substantially homologous GDNFR proteins may be isolated from such mammals by 
virtue of cross-reactivity with antibodies to the GDNFR amino acid sequences of SEQ 
ID NOs: 2 and 4. Alternatively, they may be expressed by nucleic acid sequences 

20 which are isolated through hybridization with the gene or with segments of the gene 
encoding the GDNFR of SEQ ID NOs 2 and 4 or which hybridize to a complementary 
sequence of the nucleic acid sequences illustrated in SEQ ID NOs: 2 and 4. Suitable 
hybridization conditions are described in further detail below. 

The novel GDNFR protein products are typically isolated and purified to form 

25 GDNFR protein products which are substantially free of unwanted substances that 
would detract from the use of the present polypeptides for an intended purpose. For 
example, preferred GDNFR protein products may be substantially free from the 
presence of other human (e.g., non-GDNFR) proteinaceous materials or pathological 
agents. Preferably, the GDNFR protein products are about 80% free of other proteins 

30 which may be present due to the production technique used in the manufacture of the 
GDNFR protein product. More preferably, the GDNFR protein products are about 
90% free of other proteins, particularly preferably, about 95% free of other proteins, 
and most preferably about >98% free of other proteins. In addition, the present 
invention furnishes the unique advantage of providing polynucleotide sequences for 

35 the manufacture of homogeneous GDNFR proteins. 

A variety of GDNFR variants are contemplated, including addition, deletion 
and substitution variants. For example, a series of deletion variants may be made by 
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removing one or more amino acid residues from the amino and/or carboxy termini of 
the GDNFR protein. Using rules for the prediction of signal peptide cleavage as 
described by von Heijne (von Heijne, Nucleic Acids Research, 14, 4683-4690, 1986), 
the first amino acid residue of the GDNFR protein which might be involved in GDNF 
5 binding is Ser- as depicted in the full length amino acid sequence of human GDNFR 
in Figure 2 (SEQ CD NO:2). Amino acid residues Met 1 through Ser 18 are in the 
amino-terminal hydrophobic region that is likely to be part of a signal peptide 
sequence, and therefore, not be included in the mature form of the receptor protein. 
Similarly, the last amino acid residue of the GDNFR protein which is likely to be 
10 necessary for GDNF binding is Ser 446 . Amino acid residues Leu 447 through Ser 465 
are in the carboxy-terminal hydrophobic region that is involved in the GPI linkage of 
the protein to the cell surface. Thus, it is contemplated that any or all of the residues 
from Met 1 through Ser 18 and/or Leu 447 through Ser 465 (as depicted in Figure 2 
(SEQ ID NO:2) may be removed from the protein without affecting GDNF binding to 
the GDNFR protein, thereby leaving a "core'* sequence of Ala 19 through Pro 44 ^ 
Using known analysis techniques, it is further contemplated that N-terminal 
truncations may include the removal of one or more amino acid residues up to and 
including Gly 24 . Thus, GDNFR truncation analogs also may include the deletion of 
one or more amino acid residues from either or both termini such that an amino acid 
sequence of Asp 25 through Pro 446 or Leu 447 forms the basis for a core molecule. 
Additional GDNFR analogs are contemplated as involving amino acid residues Ser I 8 
through Pro 449 as depicted in the GDNFR amino acid sequence of Figure 4 (SEQ ID 
NO:4) , i.e., deleting one or more amino acid residues from either or both termini 
involving the hydrophobic regions depicted as amino acid residues Met! through 
Ser I 8 and/or Pro 449 through Ser 4 * 8 

In addition, it is contemplated that one or more amino acid residues may be 
removed from either or both of the amino and carboxy termini until the first and last 
cysteine residues in the full length sequence are reached. It is advantageous to retain 
the cysteine residues for the proper intramolecular binding of the GDNFR protein. As 
depicted in the full length amino acid sequence of human GDNFR in Figure 2 (SEQ ID 
NO:2), any or all of amino acid residues from Met 1 to Asp 28 may be removed from 
the amino terminal without removing the first cysteine residue which appears as 
Cys 29 Similarly, any or all of amino acid residues from Gly 443 to Set 465 may be 
removed from the carboxy terminal without removing the last cysteine residue which 
appears as Cys 442 . Other GDNFR analogs may be made using amino acid residues 
Cys 29 through Cys 443 as depicted in the GDNFR amino acid sequence of Figure 4 
(SEQ ID NO:4) , i.e., deleting all or part of the terminal regions depicted as amino acid 
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residues Met 1 through Asp28 and/or Ser 444 through Ser 4 ^ 

It will be appreciated by those skilled in the art that, for the same reasons, it is 
contemplated that these identified amino acid residues may be replaced, rather than 
deleted, without affecting the function of the GDNFR protein. Alternatively, these 
5 identified amino acid residues may be modified by intra-residue insertions or terminal 
additions without affecting the function of the GDNFR protein. In yet another 
embodiment, a combination of one or more deletions, substitutions or additions may 
be made. 

10 The present GDNFR proteins or nucleic acids may be used for methods of 

treatment, or for methods of manufacturing medicaments for treatment. Such 
treatment includes conditions characterized by excessive production of GDNFR 
protein, wherein the present GDNFRs, particularly in soluble form, may be used to 
complex to and therefore inactivate such excessive GDNF protein. This treatment may 

15 be accomplished by preparing soluble receptor (e.g., use of the GDNF binding 
domain) or by preparation of a population of cells containing such GDNFR, and 
transplanting such cells into the individual in need thereof. The present GDNFR 
protein products may also be used for treatment of those having defective GDNF 
receptors. For example, one may treat an individual having defective GDNFRs by 

20 preparation and delivery of a soluble receptor, or by preparation of a population of 
cells containing such non-defective GDNFR and transplanting such cells into an 
individual. Or, an individual may have an inadequate number of GDNF receptors, and 
cells containing such receptors may be transplanted in order to increase the number of 
GDNF receptors available to an individual. Such compositions may be used in 

25 conjunction with the delivery of GDNF. It is also contemplated GDNFR protein 

products may be used in the treatment of conditions responsive to the activation of the 
c-ret receptor tyrosine kinase. 

In yet another aspect of the present invention, a further advantage to the novel 
compositions is the use of GDNFR to stabilize GDNF protein pharmaceutical 

30 compositions. In another aspect of the present invention, a GDNFR may be used to 
screen compounds for antagonist activity. 

Other aspects and advantages of the present invention will be apparent to those 
skilled in the art. For example, additional uses include new assay systems, transgenic 
animals and antibody production. 

35 

Study Models 

The present invention provides for assay systems in which GDNF activity or 
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activities similar to GDNF activity resulting from exposure to a peptide or non-peptide 
compound may be detected by measuring an elicited physiological response in a cell or 
cell line which expresses the GDNFR molecules of the present invention. A 
physiological response may comprise any of the biological effects of GDNF, 

survival or growth, as well as the transcriptional activation of certain nucleic acid 
sequences (e.g. promoter/enhancer elements as well as structural genes), GDNF- 
related processing, translation, or phosphorylation, and the induction of secondary 
processes in response to processes directly or indirectly induced by GDNF, to name 
10 but a few. 

For example, a model system may be created which may be used to study the 
effects of excess GDNF activity. In such a system, the response of a cell to GDNF 
may be increased by engineering an increased number of GDNFRs on the cells of the 
model system relative to cells which have not been so modified. A system may also be 

1 5 developed to selectively provide an increased number of GDNFRs on cells which 
normally express GDNFRs. In order to ensure expression of GDNFR, the GDNFR 
gene may be placed under the control of a suitable promoter sequence. It may be 
desirable to put the GDNFR gene under the control of a constitutive and/or tissue 
specific promoter (including but not limited to the CNS neuron specific enolase, 

20 neurofilament, and tyrosine hydroxylase promoter), an inducible promoter (such as the 
metallothionein promoter), the UV activated promoter in the human immunodeficiency 
virus long terminal repeat (Valeri et al., 1988, Nature 333:78-81), or the CMV 
promoter (as contained in pCMX, infra) or a developmentally regulated promoter. 
By increasing the number of cellular GDNFRs, the response to endogenous 

25 GDNF may be increased. If the model system contains little or no GDNF, GDNF 
may be added to the system. It may also be desirable to add additional GDNF to the 
model system in order to evaluate the effects of excess GDNF activity. Over 
expressing GDNF (or secreted GDNF) may be one method for studying the effects of 
elevated levels of GDNF on cells already expressing GDNFR. 

30 

GDNFR Therapies 

In another aspect, certain conditions may benefit from an increase in GDNF 
responsiveness. It may, therefore, be beneficial to increase the number or binding 
affinity of GDNFRs in patients suffering from conditions responsive to GDNF 
35 therapy. This could be achieved through gene therapy, whereby selective expression 
of recombinant GDNFR in appropriate cells is achieved, for example, by using 
GDNFR genes controlled by tissue specific or inducible promoters or by producing 
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localized infection with replication defective viruses carrying a recombinant GDNFR 
gene. 

It is envisioned that conditions which will benefit from GDNFR or combined 
GDNF/GDNFR delivery include, but are not limited to, motor neuron disorders 

5 including amyotrophic lateral sclerosis, neurological disorders associated with 
diabetes, Parkinson's disease, Alzheimer's disease, and Huntington's chorea. 
Additional indications for the use of GDNFR or combined GDNF/GDNFR delivery 
are described above and further include the treatment of: glaucoma or other diseases 
and conditions involving retinal ganglion cell degeneration; sensory neuropathy caused 

10 by injury to, insults to, or degeneration of, sensory neurons; pathological conditions, 
such as inherited retinal degenerations and age, disease or injury-related retinopathies, 
in which photoreceptor degeneration occurs and is responsible for vision loss; and 
injury or degeneration of inner ear sensory cells, such as hair cells and auditory 
neurons for preventing and/or treating hearing loss due to variety of causes. 

15 

Transgenic Animals 

In yet another aspect, a recombinant GDNFR gene may be used to inactivate or 
"knock out" the endogenous gene (e.g., by homologous recombination) and thereby 
create a GDNFR deficient cell, tissue, or animal. For example, a recombinant 

20 GDNFR gene may be engineered to contain an insertional mutation which inactivates 
GDNFR. Such a construct, under the control of a suitable promoter, may be 
introduced into a cell, such as an embryonic stem cell, by any conventional technique 
including transfection, transduction, injection, etc. Cells containing the construct may 
then be selected, for example by G418 resistance. Cells which lack an intact GDNFR 

25 gene are then identified (e. g., by Southern blotting or Northern blotting or assay of 
expression). Cells lacking an intact GDNFR gene may then be fused to early embryo 
cells to generate transgenic animals deficient in GDNFR. A comparison of such an 
animal with an animal not expressing endogenous GDNF would reveal that either the 
two phenotypes match completely or that they do not, implying the presence of 

30 additional GDNF-like factors or receptors. Such an animal may be used to define 
specific neuronal populations, or other in vivo processes, normally dependent upon 
GDNF. Thus, these populations or processes may be expected to be effected if the 
animal did not express GDNFR, and therefore, could not respond to GDNF. 



35 



Diagnostic Applications 

According to the present invention, GDNFR probes may be used to identify 
cells and tissues which are responsive to GDNF in normal or diseased states. The 
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present invention provides for methods for identifying cells which are responsive to 
GDNF by detecting GDNFR expression in such cells, GDNFR expression may be 
evidenced by transcription of GDNFR mRNA or production of GDNFR protein. 
GDNFR expression may be detected using probes which identify GDNFR nucleic acid 

Ul piuicui sjl uwiv\*tiiig lo£ jw^uwiivwj cutiiivittujr auuwu uit> VJL/l^l IV pi uiwm. 

One variety of probe which may be used to detect GDNFR expression is a 
nucleic acid probe, which may be used to detect GDNFR-encoding RNA by any 
method known in the art, including, but not limited to, in situ hybridization, Northern 
blot analysis, or PCR related techniques. Nucleic acid products of the invention may 
be labeled with detectable markers (such as radiolabels and non-isotopic labels such as 
biotin) and employed in hybridization processes to locate the human GDNFR gene 
position and/or the position of any related gene family in a chromosomal map. They 
may also be used for identifying human GDNFR gene disorders at the DNA level and 
used as gene markers for identifying neighboring genes and their disorders. 
Contemplated herein are kits containing such labeled materials. 

Polypeptide products of the invention may be 'labeled" by association with a 
detectable marker substance or label (e.g., a radioactive isotope, a fluorescent or 
chemiluminescent chemical, an enzyme or other label available to one skilled in the art) 
to provide reagents useful in detection and quantification of GDNF in solid tissue and 
fluid samples such as blood or urine. Such products may also be used in detecting 
cells and tissues which are responsive to GDNF in normal or diseased states. 

Another possible assay for detecting the presence of GDNF in a test sample or 
screening for the presence of a GDNF-like molecule involves contacting the test 
sample with a GDNFR peptide immobilized on a solid phase, thereby producing 
GDNFR-bound GDNF. The GDNFR-bound GDNF may optionally be contacted 
with a detection reagent, such as a labeled antibody specific for GDNF, thereby 
forming a detectable product. Such assays may be developed in the form of assay 
devices for analyzing a test sample. In a basic form, such devices include a solid 
phase containing or coated with GDNFR. A method for analyzing a test sample for 
the presence of GDNF may involve contacting the sample to an assay reagent 
comprising GDNFR protein, wherein said GDNFR protein reacts with GDNF present 
in the test sample and produces a detectable reaction product indicative of the presence 



The assay reagents provided herein may also be embodied as part of a kit or 
article of manufacture. Contemplated is an article of manufacture comprising a 
packaging material and one or more preparations of the presently provided nucleic acid 
or amino acid sequences. Such packaging material will comprise a label indicating that 



of GDNF. 
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the preparation is useful for detecting GDNF, GDNFR or GDNFR defects in a 
biological sample. As such, the kit may optionally include materials to cany out such 
testing, such as reagents useful for performing protein analysis, DNA or RNA 
hybridization analysis, or PCR analysis on blood, urine, or tissue samples. 

5 

Ami-qPNFR Antibody 

According to the present invention, GDNFR protein, or fragments or 
derivatives thereof, may be used as an immunogen to generate anti-GDNFR 
antibodies. To further improve the likelihood of producing an anti-GDNFR immune 

10 response, the amino acid sequence of GDNFR may be analyzed in order to identify 
portions of the molecule which may be associated with increased immunogenicity. 
For example, the amino acid sequence may be subjected to computer analysis to 
identify surface epitopes which present computer-generated plots of hydrophilicity, 
surface probability, flexibility, antigenic index, amphiphilic helix, amphiphilic sheet, 

1 5 and secondary structure of GDNFR. Alternatively, the amino acid sequences of 
GDNFR from different species could be compared, and relatively non-homologous 
regions identified; these non-homologous regions would be more likely to be 
immunogenic across various species. 

Also comprehended are polypeptide fragments duplicating only a part of the 
20 continuous amino acid sequence or secondary conformations within GDNFR, which 
fragments may possess one activity of mammalian GDNFR (e.g., immunological 
activity) and not others (e.g., GDNF protein binding activity). Thus, the production 
of antibodies can include the production of anti-peptide antibodies. The following 
exemplary peptides were synthesized using GDNFR sequences: 

25 

TABLE 1 
GDNFR Peptides 



SJP-6 h 2 n-qscstkyrtl-cooh human GDNFR, AA 40-49 (SEQ ID NO:25) 

SJP-7 h 2 n-ckrgmkkekn-cooh human GDNFR, AA 89-98 (SEQIDNO:26) 

SJP-8 h 2 n-lledspyepv-cooh human GDNFR, AA 1 15-124 (SEQ ID NO:27) 

S JP-9 h 2 n • csyeererpn - cooh rat GDNFR, AA 233-242 (SEQ ID NO:28) 

SJP- 10 h 2 n - pap p vqtttatttt - cooh rat GDNFR, AA 356-369 (SEQ ID NO:29) 

30 Peptides SJP-6, 7, and 8 are identical in rat and human GDNFR. Peptides SJP-9 and 
10 are derived from the rat sequence and are each one amino acid different from 
human. Both polyclonal and monoclonal antibodies may be made by methods known 
in the art using these peptides or other portions of GDNFR. 
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Monoclonal antibodies directed against GDNFR may be prepared by any 
known technique which provides for the production of antibody molecules by 
continuous cell lines in culture. For example, the hybridoma technique originally 
developed by Kohler and Milstein to produce monoclonal antibodies (Nature, 
5 256:495-497, 1975), as weii as the trioma technique, the human B-ceii hybridoma 
technique (Kozbor et al., Immunology Today 4:72, 1983), the EBV-hybridoma 
technique (Cole et al., in "Monoclonal Antibodies and Cancer Therapy," Alan R. Liss, 
Inc. pp. 77-96, 1985), and the like, may be used. 

Human monoclonal antibodies or chimeric human-mouse (or other species) 

10 monoclonal antibodies also may be prepared for therapeutic use and may be made by 
any of numerous techniques known in the art (e.g., Teng et al., Proc. Natl. Acad. Sci. 
U.S.A., 80:7308-7312, 1983; Kozbor et al., Immunology Today, 4:72-79, 1983; 
Olsson et al., Meth. EnzymoL, 92:3-16, 1982). Chimeric antibody molecules may be 
prepared containing a mouse antigen-binding domain with human constant regions 

15 (Morrison et al., Proc. Natl. Acad. Sci. U.S.A., 81:6851, 1984; Takeda et al., 
Nature, 314:452, 1985). 

Various procedures known in the art also may be used for the production of 
polyclonal antibodies. For the production of antibody, various host animals including, 
but not limited to, rabbits, mice,* rats, etc., can be immunized by injection with 

20 GDNFR protein, or a fragment or derivative thereof. Various adjuvants may be used 
to increase the immunological response, depending on the host species selected. 
Useful adjuvants include, but are not limited to, Freund's (complete and incomplete), 
mineral gels such as aluminum hydroxide, surface active substances such as 
lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet 

25 hemocyanins, dinitrophenol, and human adjuvants such as BCG (Bacille Calmette- 
Guerin) and Corynebacterium parvum. 

A molecular clone of an antibody to a GDNFR epitope also may be prepared 
by known techniques. Recombinant DNA methodology (see e.g., Maniatis et al., 
Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory, Cold 

30 Spring Harbor, N.Y., 1982) may be used to construct nucleic acid sequences which 
encode a monoclonal antibody molecule, or antigen binding region thereof. 

Antibody molecules may be purified by known techniques, e.g., 
immunoabsorption or immunoaffinity chromatography, chromatographic methods 
such as high performance liquid chromatography, or a combination thereof, etc. The 

35 present invention provides for antibody molecules as well as fragments of such 

antibody molecules. Antibody fragments which contain the idiotype of the molecule 
can be generated by known techniques. For example, such fragments include but are 
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not limited to: the F(ab')2 fragment which can be produced by pepsin digestion of the 
antibody molecule; the Fab' fragments which can be generated by reducing the 
disulfide bridges of the F(ab')2 fragment, and the Fab fragments which can be 
generated by treating the antibody molecule with papain and a reducing agent. 

Such selective binding molecules may themselves be alternatives to GDNFR 
protein, and may be formulated as a pharmaceutical composition. 



Recombinant Expression of QDNFR Puffin 

The present invention provides various polynucleotides encoding GDNFR 
proteins. The expression product or a derivative thereof is characterized by the ability 
to bind to GDNF specifically and with high affinity so that further interactions with 
signaling molecules can occur, thereby providing or enhancing GDNF activity such as 
increasing dopamine uptake by dopaminergic cells. The polynucleotides may also be 
used in cell therapy or gene therapy applications. 

According to the present invention, novel GDNFR protein and DNA sequences 
coding for all or part of such receptors are provided. Novel nucleic acid sequences of 
the invention include sequences useful in securing expression in procaryotic or 
eucaryotic host cells of polypeptide products having at least a part of the primary 
structural conformation and one or more of the biological properties of recombinant 
human GDNFR. The nucleic acids may be purified and isolated, so that the desired 
coding region is useful to produce the present polypeptides. Alternatively, the nucleic 
acid sequence may be used for diagnostic purposes, as described more fully below. 
Exemplary DNA sequences of the present invention comprise nucleic acid sequences 
encoding the GDNFR amino acid sequences depicted in Figures 2 and 4 and set forth 
in SEQ. ID NOs:2 and 4. In addition, DNA sequences disclosed by the present 
invention specifically comprise: (a) any of the DNA sequences depicted in Figures 1 
and 3 (and complementary strands); (b) a DNA sequence which hybridizes (under 
hybridization conditions disclosed in the cDNA library screening section below, or 
equivalent conditions or more stringent conditions) to the DNA sequence in subpart (a) 
or to fragments thereof; and (c) a DNA sequence which, but for the degeneracy of the 
genetic code, would hybridize to the DNA sequence in subpart (a). Specifically 
comprehended in parts (b) and (c) are genomic DNA sequences encoding allelic variant 
forms of human GDNFR and/or encoding GDNFR from other mammalian species, 
and manufactured DNA sequences encoding GDNFR, fragments of GDNFR, and 
analogs of GDNFR which DNA sequences may incorporate codons facilitating 
transcription and translation of messenger RNA in microbial hosts. Such 
manufactured sequences may readily be constructed according to the methods known 
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in the art as well as the methods described herein. 

Recombinant expression techniques, conducted in accordance with the 
descriptions set forth herein or other known methods, may be used to produce these 
polynucleotides and express the various GDNFR proteins. For example, by inserting 
5 a nucleic acid sequence which encodes a GDNFR protein into an appropriate vector, 
one skilled in the art can readily produce large quantities of the desired nucleotide 
sequence. The sequences can then be used to generate detection probes or 
amplification primers. Alternatively, a polynucleotide encoding a GDNFR protein can 
be inserted into an expression vector. By introducing the expression vector into an 
10 appropriate host, the desired GDNFR protein may be produced in large amounts. 

As further described herein, there are numerous host/vector systems available 
for the propagation of nucleic acid sequences and/or the production of GDNFR 
proteins. These include, but are not limited to, plasmid, viral and insertional vectors, 
and prokaryotic and eukaryotic hosts. One skilled in the art can adapt a host/vector 
1 5 system which is capable of propagating or expressing heterologous DNA to produce 
or express the sequences of the present invention. 

By means of such recombinant techniques, the GDNFR proteins of the present 
invention are readily produced in commercial quantities with greater purity. 
Furthermore, it will be appreciated by those skilled in the art that, in view of the 
20 present disclosure, the novel nucleic acid sequences include degenerate nucleic acid 
sequences encoding the GDNFR proteins specifically set forth in the Figures, 
sequences encoding variants of GDNFR proteins, and those nucleic acid sequences 
which hybridize, preferably under stringent hybridization conditions, to complements 
of these nucleic acid sequences (see, Maniatis et. al., Molecular Cloning (A Laboratory 
25 Manual); Cold Spring Harbor Laboratory, pages 387 to 389, 1982.) Exemplary 
stringent hybridization conditions are hybridization in 4 x SSC at 62-67°C, followed 
by washing in 0.1 x SSC at 62-67°C for approximately an hour. Alternatively, 
exemplary stringent hybridization conditions are hybridization in 45-55% formamide, 
4 x SSC at 40-45°C. DNA sequences which hybridize to the complementary 
30 sequences for GDNFR protein under relaxed hybridization conditions and which 

encode a GDNFR protein of the present invention are also included herein. Examples 
of such relaxed stringency hybridization conditions are 4 x SSC at 45-55°C or 
hybridization with 30-40% formamide at 40-45°C 
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Preparation of Polynucleotides Encoding GDNFR 

Based upon the disclosure of the present invention, a nucleic acid sequence 
encoding a full length GDNFR polypeptide or a fragment thereof may readily be 
prepared or obtained in a variety of ways, including, without limitation, chemical 
5 synthesis, cDNA or genomic library screening, expression library screening, and/or 
PCR amplification of cDNA. These methods and others useful for preparing nucleic 
acid sequences are known in the art and are set forth, for example, by Sambrook et al. 
(Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY, 1989), by Ausubel et al., eds (Current Protocols in 

10 Molecular Biology, Current Protocols Press, 1994), and by Berger and Kimmel 
(Methods in Enzymology: Guide to Molecular Cloning Techniques, vol. 152, 
Academic Press, Inc., San Diego, CA, 1987). Preferred nucleic acid sequences 
encoding GDNFR are mammalian sequences. 

Chemical synthesis of a nucleic acid sequence which encodes a GDNFR 

15 protein can also be accomplished using methods known in the art, such as those set 
forth by Engels et al. (Angew. Chem. Intl. Ed., 28:716-734, 1989). These methods 
include, inter alia, the phosphotriester, phosphoramidite and H-phosphonate methods 
of nucleic acid sequence synthesis. A preferred method for such chemical synthesis is 
polymer-supported synthesis using standard phosphoramidite chemistry. Typically, 

20 the DNA encoding the desired polypeptide will be several hundred base pairs (bp) or 
nucleotides in length. Nucleic acid sequences larger than about 100 nucleotides can be 
synthesized as several fragments using these methods. The fragments can then be 
ligated together to form a sequence for the expression of a full length GDNFR 
polypeptide or a portion thereof. 

25 Alternatively, a suitable nucleic acid sequence may be obtained by screening an 

appropriate cDNA library (i.e., a library prepared from one or more tissue source(s) 
believed to express the protein) or a genomic library (a library prepared from total 
genomic DNA). The source of the cDNA library is typically a tissue that is believed to 
express GDNFR in reasonable quantities. Typically, the source of the genomic library 

30 is any tissue or tissues from a mammalian species believed to harbor a gene encoding 
GDNFR. The library can be screened for the presence of the GDNFR cDNA/gene 
using one or more nucleic acid probes (such as oligonucleotides, cDNA or genomic 
DNA fragments based upon the presently disclosed sequences) that will hybridize 
selectively with GDNFR cDNA(s) or gene(s) present in the library. The probes 

35 typically used for such library screening usually encode a small region of the GDNFR 
nucleic acid sequence from the same or a similar species as the species from which the 
library was prepared. Alternatively, the probes may be degenerate, as discussed 
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herein. 

Library screening is typically accomplished by annealing the oligonucleotide 
probe or cDNA to the clones in the library under conditions of stringency that prevent 
non-specific binding but permit binding (hybridization) of those clones that have a 

5 significant ievei of homoiogy with the probe or primer. Typical hybridization and 
washing stringency conditions depend in part on the size (i.e., number of nucleotides 
in length) of the cDNA or oligonucleotide probe, and whether the probe is degenerate. 
The probability of obtaining a clone(s) is also considered in designing the 
hybridization solution (e.g., whether a cDNA or genomic library is being screened; if 

10 it is a cDNA library, the probability that the cDNA of interest is present at a high 
level). 

Where DNA fragments (such as cDNAs) are used as probes, typical 
hybridization conditions include those as set forth in Ausubel et al., eds., supra. After 
hybridization, the blot containing the library is washed at a suitable stringency, 

15 depending on several factors such as probe size, expected homology of probe to clone, 
type of library being screened, number of clones being screened, and the like. 
Examples of stringent washing solutions (which are usually low in ionic strength and 
are used at relatively high temperatures) are as follows. One such stringent wash is 
0.015 M NaCl, 0.005 M NaCitrate and 0.1% SDS at 55-65°C. Another such stringent 

20 buffer is 1 mM Na2EDTA, 40 mM NaHP04, pH 7.2, and 1% SDS at about 40-50°C. 

One other stringent wash is 0.2 X SSC and 0. 1% SDS at about 50-65°C. 

There are al$o exemplary protocols for stringent washing conditions where 
oligonucleotide probes are used to screen cDNA or genomic libraries. For example, a 
first protocol uses 6 X SSC with 0.05 percent sodium pyrophosphate at a temperature 

25 of between about 35 and 62°C, depending on the length of the probe. For example, 14 
base probes are washed at 35-40°C, 17 base probes at 45-50°C, 20 base probes at 52- 
57°C, and 23 base probes at 57-63°C. The temperature can be increased 2-3 °C where 
the background non-specific binding appears high. A second protocol uses 
tetramethylammonium chloride (TMAC) for washing. One such stringent washing 

30 solution is 3 M TMAC, 50 mM Tris-HCl, pH 8.0, and 0.2% SDS. 

Another suitable method for obtaining a nucleic acid sequence encoding a 
GDNFR protein is by polymerase chain reaction (PCR). In this method, 
poly(A)+RNA or total RNA is extracted from a tissue that expresses GDNFR. A 
cDNA is then prepared from the RNA using the enzyme reverse transcriptase (i.e., 

35 RT-PCR). Two primers, typically complementary to two separate regions of the 
GDNFR cDN A (oligonucleotides), are then added to the cDNA along with a 
polymerase such as Taq polymerase, and the polymerase amplifies the cDNA region 
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between the two primers. 

Where the method of choice for preparing the nucleic acid sequence encoding 
the desired GDNFR protein requires the use of oligonucleotide primers or probes 
(e.g., PCR, cDNA or genomic library screening), the oligonucleotide sequences 
5 selected as probes or primers should be of adequate length and sufficiently 

unambiguous so as to minimize the amount of non-specific binding that will occur 
during library screening or PCR amplification. The actual sequence of the probes or 
primers u usually based on conserved or highly homologous sequences or regions 
from the same or a similar gene from another organism, such as the rat nucleic acid 
10 sequence involved in the present invention. Optionally, the probes or primers can be 
fully or partially degenerate, i.e., contain a mixture of probes/primers, all encoding the 
same amino acid sequence, but using different codons to do so. An alternative to 
preparing degenerate probes is to place an inosine in some or all of those codon 
positions that vary by species. The oligonucleotide probes or primers may be prepared 
15 by chemical synthesis methods for DNA as described above. 

GDNFR proteins based on these nucleic acid sequences encoding GDNFR as 
well as mutant or variant sequences thereof, are also contemplated as within the scope 
of the present invention. Mutant or variant sequences include those sequences 
containing one or more nucleotide substitutions, deletions, and/or insertions as 
20 compared to the wild type sequence and that results in the expression of amino acid 
sequence variations as compared to the wild type amino acid sequence. In some cases 
naturally occurring GDNFR amino acid mutants or variants may exist, due to the 
existence of natural allelic variation. GDNFR proteins based on such naturally 
occurring mutants or variants are also within the scope of the present invention 
25 Preparation of synthetic mutant sequences is also well known in the art, and is 
described for example in Wells et al. (Gene, 34:315, 1985) and in Sambrook et al 
supra. 

In some cases, it may be desirable to prepare nucleic acid and/or amino acid 
variants of naturally occurring GDNFR. Nucleic acid variants (wherein one or more 
30 nucleotides are designed to differ from the wild-type or naturally occurring GDNFR) 
may be produced using site directed mutagenesis or PCR amplification where the 
pnmer(s) have the desired point mutations (see Sambrook et al., supra, and Ausubel et 
al., supra, for descriptions of mutagenesis techniques). Chemical synthesis using 
methods described by Engels et al.. supra, may also be used to prepare such variants 
Other methods known to the skilled artisan may be used as well. Preferred nucleic 
acid variants are those containing nucleotide substitutions accounting for codon 
preference in the host cell that is to be used to recombinant^ produce GDNFR. Other 
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preferred variants are those encoding conservative amino acid changes (e.g., wherein 
the charge or polarity of the naturally occurring amino acid side chain is not altered 
substantially by substitution with a different amino acid) as compared to wild type, 
and/or those designed to either generate a novel glycosylation and/or phosphorylation 
5 site(s) on GDNFR, or those designed to deiete an existing giycpsylation anci/or 
phosphorylation site(s) on GDNFR. 

Vectors 

The cDN A or genomic DNA encoding the desired GDNFR protein is inserted 
10 into a vector for further cloning (amplification of the DNA) or for expression. Suitable 
vectors are commercially available, or the vector may be specially constructed. 
Possible vectors include, but are not limited to, cosmids, plasmids or modified 
viruses, but the vector system must be compatible with the selected host cell. Such 
vectors include, but are not limited to, bacteriophages such as lambda derivatives, or 
15 plasmids such as pBR322, pUC, or Bluescript® plasmid derivatives (Stratagene, La 
Jolla CA). The recombinant molecules can be introduced into host cells via 
transformation, transfection, infection, electroporation, or other known techniques. 

For example, the GDNFR-encoding nucleic acid sequence is inserted into a 
cloning vector which is used to transform, transfect, or infect appropriate host cells so 
20 that many copies of the nucleic acid sequence are generated. This can be accomplished 
by ligating a DNA fragment into a cloning vector which has complementary cohesive 
termini. If the complementary restriction sites used to fragment the DNA are not 
present in the cloning vector, the ends of the DNA molecules may be enzymatically 
modified. It also may prove advantageous to incorporate restriction endonuclease 
25 cleavage sites into the oligonucleotide primers used in polymerase chain reaction to 
facilitate insertion of the resulting nucleic acid sequence into vectors. Alternatively, 
any site desired may be produced by ligating nucleotide sequences (linkers) onto the 
DNA termini; these ligated linkers may comprise specific chemically synthesized 
oligonucleotides encoding restriction endonuclease recognition sequences. In an 
30 alternative method, the cleaved vector and GDNFR-encoding nucleic acid sequence 
may be modified by homopolymeric tailing. In specific embodiments, transformation 
of host cells with recombinant DNA molecules that incorporate an isolated GDNFR 
gene, cDNA, or synthesized DNA sequence enables generation of multiple copies of 
the gene. Thus, the GDNFR-encoding nucleic acid sequence may be obtained in large 
35 quantities by growing transformants, isolating the recombinant DNA molecules from 
the transformants and, when necessary, retrieving the inserted gene from the isolated 
recombinant DNA. 
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The selection or construction of the appropriate vector will depend on 1) 
whether it is to be used for DNA amplification or for DNA expression, 2) the size of 
the DNA to be inserted into the vector, and 3) the host cell (e.g., mammalian, insect, 
yeast, fungal, plant or bacterial cells) to be transformed with the vector. Each vector 

5 contains various components depending on its function (amplification of DNA or 
expression of DNA) and its compatibility with the intended host cell. For DNA 
expression, the vector components may include, but are not limited to, one or more of 
the following: a signal sequence, an origin of replication, one or more selection or 
marker genes, enhancer elements, promoters, a transcription termination sequence, 

10 and the like. These components may be obtained from natural sources or synthesized 
by known procedures. The vectors of the present invention involve a nucleic acid 
sequence which encodes the GDNFR protein of interest operatively linked to one or 
more amplification, expression control, regulatory or similar operational elements 
capable of directing, controlling or otherwise effecting the amplification or expression 

15 of the GDNFR-encoding nucleic acid sequence in the selected host cell. 

Expression vectors containing GDNFR nucleic acid sequence inserts can be 
identified by three general approaches: (a) DNA-DNA hybridization; (b) the presence 
or absence of "marker" gene functions, and (c) the expression of inserted sequences. 
In the first approach, the presence of a foreign nucleic acid sequence inserted in an 

20 expression vector can be detected by DNA-DNA hybridization using probes 

comprising sequences that are homologous to an inserted GDNFR-encoding nucleic 
acid sequence. In the second approach, the recombinant vector/host system can be 
identified and selected based upon the presence or absence of certain "marker" gene 
functions (e.g., thymidine kinase activity, resistance to antibiotics, transformation 

25 phenotype, occlusion body formation in baculovirus, etc.) caused by the insertion of a 
foreign nucleic acid sequence into the vector. For example, if a GDNFR-encoding 
nucleic acid sequence is inserted within the marker gene sequence of the vector, 
recombinants containing the GDNFR insert can be identified by the absence of the 
marker gene function. In the third approach, recombinant expression vectors can be 

30 identified by detecting the foreign protein product expressed by the recombinant 
nucleic acid sequence. Such assays can be based on the physical or functional 
properties of the expressed GDNFR protein product, for example, by binding of the 
GDNFR protein to GDNF or to an antibody which directly recognizes GDNFR. 

35 Signal Sequence 

The signal sequence may be a component of the vector, or it may be a part of 
GDNFR DNA that is inserted into the vector. The native GDNFR DNA encodes a 
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signal sequence at the amino terminus of the protein that is cleaved during post- 
translational processing of the protein to form the mature GDNFR protein. Included 
within the scope of this invention are GDNFR polynucleotides with the native signal 
sequence as well as GDNFR polynucleotides wherein the native signal sequence is 

j uciclcu oiiu icpia^wu wiui a uwiui uivgvjuo oigiiai ac^ucutc. i lit uciciuiuguud aiguai 

sequence selected should be one that is recognized and processed, i.e., cleaved by a 
signal peptidase, by the host cell. For prokaryotic host cells that do not recognize and 
process the native GDNFR signal sequence, the signal sequence is substituted by a 
prokaryotic signal sequence selected, for example, from the group of the alkaline 
10 phosphatase, penicillinase, or heat-stable enterotoxin II leaders. For yeast secretion, 
the native GDNFR signal sequence may be substituted by the yeast invertase, alpha 
factor, or acid phosphatase leaders. In mammalian cell expression the native signal 
sequence is satisfactory, although other mammalian signal sequences may be suitable. 

is Origin of Replication 

Expression and cloning vectors generally include a nucleic acid sequence that 
enables the vector to replicate in one or more selected host cells. In cloning vectors, 
this sequence is typically one that enables the vector to replicate independently of the 
host chromosomal DNA, and includes origins of replication or autonomously 

20 replicating sequences. Such sequences are well known for a variety of bacteria, 

yeasts, and viruses. The origin of replication from the plasmid pBR322 is suitable for 
most Gram-negative bacteria and various origins (e.g., SV40, polyoma, adenovirus, 
VSV or BPV) are useful for cloning vectors in mammalian cells. Generally, the origin 
of replication component is not needed for mammalian expression vectors (for 

25 example, the SV40 origin is often used only because it contains the early promoter). 

Selection Gene 

The expression and cloning vectors may contain a selection gene. This gene 
encodes a "marker" protein necessary for the survival or growth of the transformed 

30 host cells when grown in a selective culture medium. Host cells that were not 

transformed with the vector will not contain the selection gene, and therefore, they will 
not survive in the culture medium. Typical selection genes encode proteins that (a) 
confer resistance to antibiotics or other toxins, e.g., ampicillin, neomycin, 
methotrexate, or tetracycline; (b) complement auxotrophic deficiencies; or (c) supply 

35 critical nutrients not available from the culture medium. 

Other selection genes may be used to amplify the gene which will be 
expressed. Amplification is the process wherein genes which are in greater demand 
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for the production of a protein critical for growth are reiterated in tandem within the 
chromosomes of successive generations of recombinant cells. Examples of suitable 
selectable markers for mammalian cells include dihydrofolate reductase (DHFR) and 
thymidine kinase. The mammalian cell transformants are placed under selection 
5 pressure which only the transformants are uniquely adapted to survive by virtue of the 
marker present in the vector. Selection pressure is imposed by culturing the 
transformed cells under conditions in which the concentration of selection agent in the 
medium is successively changed, thereby leading to amplification of both the selection 
gene and the DNA that encodes GDNFR. As a result, increased quantities of GDNFR 
1 0 are synthesized from the amplified DNA. 

For example, cells transformed with the DHFR selection gene are first 
identified by culturing all of the transformants in a culture medium that contains 
methotrexate, a competitive antagonist of DHFR. An appropriate host cell when wild, 
type DHFR is used is the Chinese hamster ovary cell line deficient in DHFR activity 
(see, for example, Urlaub and Chasin, Proc. Natl. Acad. Sci., U.S.A., 77(7): 4216- 
4220, 1980). The transformed cells are then exposed to increased levels of 
methotrexate. This leads to the synthesis of multiple copies of the DHFR gene, and, 
concomitantly, multiple copies of other DNA present in the expression vector, such as 
the DNA encoding a GDNFR protein. 
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Promoter 

The expression and cloning vectors of the present invention will typically 
contain a promoter that is recognized by the host organism and operably linked to the 
nucleic acid sequence encoding the GDNFR protein. Promoters are untranslated 
sequences located upstream (5') to the start codon of a structural gene (generally 
within about 1 00 to 1000 bp) that control the transcription and translation of a 
particular nucleic acid sequence, such as that encoding GDNFR. Promoters are 
conventionally grouped into one of two classes, inducible promoters and constitutive 
promoters. Inducible promoters initiate increased levels of transcription from DNA 
30 under their control in response to some change in culture conditions, such as the 
presence or absence of a nutrient or a change in temperature. A large number of 
promoters, recognized by a variety of potential host cells, are well known. These 
promoters are operably linked to the DNA encoding GDNFR by removing the 
promoter from the source DNA by restriction enzyme digestion and inserting the 
desired promoter sequence into the vector. The native GDNFR promoter sequence 
may be used to direct amplification and/or expression of GDNFR DNA. A 
heterologous promoter is preferred, however, if it permits greater transcription and 
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higher yields of the expressed protein as compared to the native promoter, and if it is 
compatible with the host cell system that has been selected for use. 

Promoters suitable for use with prokaryotic hosts include the beta-lactamase 
and lactose promoter systems; alkaline phosphatase, a tryptophan (trp) promoter 
5 system; and hybrid promoters such as the tac promoter. Other known bacterial 

promoters are also suitable. Their nucleotide sequences have been published, thereby 
enabling one skilled in the art to ligate them to the desired DNA sequence(s), using 
linkers or adaptors as needed to supply any required restriction sites. 

Suitable promoting sequences for use with yeast hosts are also well known in 
10 the art. Yeast enhancers are advantageously used with yeast promoters. Suitable 
promoters for use with mammalian host cells are well known and include those 
obtained from the genomes of viruses such as polyoma virus, fowlpox virus, 
adenovirus (such as Adenovirus 2), bovine papilloma virus, avian sarcoma virus, 
cytomegalovirus, a retrovirus, hepatitis-B virus and most preferably Simian Virus 40 
15 (SV40). Other suitable mammalian promoters include heterologous mammalian 
promoters, e.g., heat^shock promoters and the actin promoter. A promoter for 
possible use in the production of GDNFR proteins in CHO cells is SRa (see Takebe et 
al., Mol. Cell. Biol., 8(1): 466-472, 1988). A suitable expression vector is pDSRa2. 
The pDSRa2 plasmid constructs containing the appropriate GDNFR cDNA may be 
20 prepared substantially in accordance with the process described in the co-owned and 
copending U. S. Patent Application Serial Number 501,904 filed March 29, 1990 
(also see, European Patent Application No. 90305433, Publication No. EP 398 753, 
filed May 18, 1990 and WO 90/14363 (1990), the disclosures of which are hereby 
incorporated by reference. 
25 Additional promoters which may be of interest in controlling GDNFR 

expression include, but are not limited to: the SV40 early promoter region (Bemoist 
and Chambon, Nature, 290:304-310, 1981); the CMV promoter; the promoter 
contained in the 3' long terminal repeat of Rous sarcoma virus (Yamamoto, et al., 
Cell, 22:787-797, 1980); the herpes thymidine kinase promoter (Wagner et al., Proc. 
30 Natl. Acad. Sci. U.S.A., 78: 144-1445, 1981); the regulatory sequences of the 
metallothionine gene (Brinster et al., Nature, 296:39-42, 1982); prokaryotic 
expression vectors such as the beta -lactamase promoter (Villa-Kamaroff, et al., Proc. 
Natl. Acad. Sci. U.S.A., 75:3727-3731, 1978); or the tac promoter (DeBoer, et al., 
Proc. Natl. Acad. Sci. U.S.A., 80:21-25, 1983). Also of interest are the following 
35 animal transcriptional control regions, which exhibit tissue specificity and have been 
utilized in transgenic animals: the elastase 1 gene control region which is active in 
pancreatic acinar cells (Swift et al., Cell, 38:639-646, 1984; Ornitz et al, Cold Spring 
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Harbor Symp. Quant. Biol. 50:399-409, 1986; MacDonald, Hepatology, 7:425-515, 
1987); the insulin gene control region which is active in pancreatic beta cells 
(Hanahan, Nature, 315:1 15-122, 1985); the immunoglobulin gene control region 
which is active in lymphoid cells (Grosschedl et aL Cell, 38:647-658, 1984; Adames 
5 et aL, Nature, 318:533-538, 1985; Alexander et aL, Mol. Cell. BioL, 7:1436-1444, 
1987); the mouse mammary tumor virus control region which is active in testicular, 
breast, lymphoid and mast cells (Leder et aL, Cell, 45:485-495, 1986), albumin gene 
control region which is active in liver (Pinkert et aL, Genes and DeveL, 1:268-276, 
1987); the alpha-fetoprotein gene control region which is active in liver (Krumlauf et 

10 aL, Mol. Cell. BioL, 5:1639-1648, 1985; Hammer et aL, Science, 235:53-58, 1987); 
the alpha 1 -antitrypsin gene control region which is active in the liver (Kelsey et aL, 
Genes and DeveL, 1:161-171, 1987); the beta-globin gene control region which is 
active in myeloid cells (Mogram et aL, Nature, 3 15:338-340, 1985; Kollias et aL, 
Cell, 46:89-94, 1986); the myelin basic protein gene control region which is active in 

1 5 oligodendrocyte cells in the brain (Readhead et aL, Cell, 48:703-7 1 2, 1 987); the 
myosin light chain-2 gene control region which is active in skeletal muscle (Sani, 
Nature, 314:283-286, 1985); and the gonadotropic releasing hormone gene control 
region which is active in the hypothalamus (Mason et aL, Science, 234:1372-1378, 
1986). 

20 

Enhancer Element 

An enhancer sequence may be inserted into the vector to increase the 
transcription of a DNA sequence encoding a GDNFR protein of the present invention 
by higher eukaryotes. Enhancers are cis-acting elements of DNA, usually about 

25 10-300 bp in length, that act on the promoter to increase its transcription. Enhancers 
are relatively orientation and position independent. They have been found 5* and 3* to 
the transcription unit. Several enhancer sequences available from mammalian genes 
are known (e.g., globin, elastase, albumin, alpha-feto-protein and insulin). Typically, 
however, an enhancer from a virus will be used. The SV40 enhancer, the 

30 cytomegalovirus early promoter enhancer, the polyoma enhancer, and adenovirus 
enhancers are exemplary enhancing elements for the activation of eukaryotic 
promoters. While an enhancer may be spliced into the vector at a position 5* or 3' to 
GDNFR DNA, it is typically located at a site 5 1 from the promoter. 



35 



Transcription Termination 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, 
animal, human, or nucleated cells from other multicellular organisms) will also contain 
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sequences necessary for terminating transcription and stabilizing the mRNA. Such 
sequences are commonly available from the 5' and occasionally 3* untranslated regions 
of eukaryotic DNAs or cDNAs. These regions contain nucleotide segments 
transcribed as polyadenylated fragments in the untranslated portion of the mRNA 

i- i: rrrvxrcD 

The construction of suitable vectors containing one or more of the above-listed 
components together with the desired GDNFR-encoding sequence is accomplished by 
standard ligation techniques. Isolated plasrnids or DNA fragments are cleaved, 

10 tailored, and religated in the desired order to generate the plasrnids required. To 

confirm that the correct sequences have been constructed, the ligation mixtures may be 
used to transform E. coli, and successful transformants may be selected by known 
techniques, such as ampicillin or tetracycline resistance as described above. Plasrnids 
from the transformants may then be prepared, analyzed by restriction endonuclease 

1 5 digestion, and/or sequenced to confirm the presence of the desired construct. 

Vectors that provide for the transient expression of DNA encoding GDNFR in 
mammalian cells may also be used. In general, transient expression involves the use 
of an expression vector that is able to replicate efficiently in a host cell, such that the 
host cell accumulates many copies of the expression vector and, in turn, synthesizes 

20 high levels of the desired protein encoded by the expression vector. Transient 

expression systems, comprising a suitable expression vector and a host cell, allow for 
the convenient positive identification of proteins encoded by cloned DNAs, as well as 
for the rapid screening of such proteins for desired biological or physiological 
properties. Thus, transient expression systems are particularly useful in identifying 

25 variants of the protein. 

Selection and Transformation of Host Cells 

Host cells (e.g., bacterial, mammalian, insect, yeast, or plant cells) 
transformed with nucleic acid sequences for use in expressing a recombinant GDNFR 

30 protein are also provided by the present invention. The transformed host cell is 
cultured under appropriate conditions permitting the expression of the nucleic acid 
sequence. The selection of suitable host cells and methods for transformation, culture, 
amplification, screening and product production and purification are well known in the 
art. See for example, Gething and Sambrook, Nature, 293; 620-625 (1981), or 

35 alternatively, Kaufman et al., Mol. Cell. Biol., 5 (7): 1750-1759 (1985) or Howley et 
al., U.S. Pat. No. 4,419,446. Additional exemplary materials and methods are 
discussed herein. The transformed host cell is cultured in a suitable medium, and the 
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expressed GDNFR protein is then optionally recovered, isolated and purified from the 
culture medium (or from the cell, if expressed intracellularly) by an appropriate means 
known to those skilled in the art. 

Different host cells have characteristic and specific mechanisms for the 

5 translational and post-translational processing and modification (e.g., glycosylation, 
cleavage) of proteins. Appropriate cell lines or host systems can be chosen to ensure 
the desired modification and processing of the foreign protein expressed. For 
example, expression in a bacterial system can be used to produce an unglycosylated 
core protein product. Expression in yeast may be used to produce a glycosylated 

10 product. Expression in mammalian cells can be used to ensure "native" glycosylation 
of the heterologous GDNFR protein. Furthermore, different vector/host expression 
systems may effect processing reactions such as proteolytic cleavages to different 
extents. 

Suitable host cells for cloning or expressing the vectors disclosed herein are 

15 prokaryote, yeast, or higher eukaryote cells. Eukaryotic microbes such as filamentous 
fiingi or yeast may be suitable hosts for the expression of GDNFR proteins. 
Saccharomyces cerevisiae, or common baker's yeast, is the most commonly used 
among lower eukaryotic host microorganisms, but a number of other genera, species, 
and strains are well known and commonly available. 

20 Host cells to be used for the expression of glycosylated GDNFR protein are 

also derived from multicellular organisms. Such host cells are capable of complex 
processing and glycosylation activities. In principle, any higher eukaryotic cell culture 
might be used, whether such culture involves vertebrate or invertebrate cells, including 
plant and insect cells. The propagation of vertebrate cells in culture (tissue culture) is a 

25 well known procedure. Examples of useful mammalian host cell lines include, but are 
not limited to, monkey kidney CVi line transformed by SV40 (COS7), human 
embryonic kidney line (293 or 293 cells subcloned for growth in suspension culture), 
baby hamster kidney cells, and Chinese hamster ovary cells. Other suitable 
mammalian cell lines include but are not limited to, HeLa, mouse L-929 cells, 

30 3T3 lines derived from Swiss, Balb-c or NIH mice, BHK or HaK hamster cell lines. 
Suitable host cells also include prokaryotic cells. Prokaryotic host cells 
include, but are not limited to, bacterial cells, such as Gram-negative or Gram-positive 
organisms, for example, E. coli, Bacilli such as B. subtilis, Pseudomonas species 
such as P. aeruginosa, Salmonella typhimurium, or Serratia marcescans. For 

35 example, the various strains of E. coli (e.g., HB101, DH5a, DH10, and MC1061) are 
well-known as host cells in the field of biotechnology. Various strains of 
Streptomyces spp. and the like may also be employed. Presently preferred host cells 
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for producing GDNFR proteins are bacterial cells (e.g., Escherichia coli) and 
mammalian cells (such as Chinese hamster ovary cells, COS cells, etc.) 

The host cells are transfected and preferably transformed with the above- 
described expression or cloning vectors and cultured in a conventional nutrient 

5 medium. Tne medium may be modified as appropriate for inducing promoters, 
selecting transformants, or amplifying the genes encoding the desired sequences. 
Transfection and transformation are performed using standard techniques which are 
well known to those skilled in the art and which are selected as appropriate to the host 
cell involved. For example, for mammalian cells without cell walls, the calcium 

10 phosphate precipitation method may be used. Electroporation, micro injection and 
other known techniques may also be used. 

Qltnring the Host Cells 

Transformed cells used to produce GDNFR proteins of the present invention 

15 are cultured in suitable media. The media may be supplemented as necessary with 
hormones and/or other growth factors (such as insulin, transferrin, or epidermal 
growth factor), salts (such as sodium chloride, calcium, magnesium, and phosphate), 
buffers (such as HEPES), nucleosides (such as adenosine and thymidine), antibiotics 
(such as gentamicin), trace elements (defined as inorganic compounds usually present 

20 at final concentrations in the micromolar range), and glucose or other energy source. 
Other supplements may also be included, at appropriate concentrations, as will be 
appreciated by those skilled in the art. Suitable culture conditions, such as 
temperature, pH, and the like, are also well known to those skilled in the art for use 
with the selected host cells. 

25 Once the GDNFR protein is produced, it may be isolated and purified by 

standard methods including chromatography (e.g., ion exchange, affinity, and suing 
column chromatography), centrifugation, differential solubility, or by any other 
standard technique for the purification of proteins. In particular, GDNFR protein may 
be isolated by binding to an affinity column comprising GDNF or anti-GDNFR 

30 antibody bound to a stationary support. 

Homologous Recombination 

It is further envisioned that GDNFR proteins may be produced by homologous 
recombination, or with recombinant production methods utilizing control elements 
35 introduced into cells already containing DNA encoding GDNFR. For example, 
homologous recombination methods may be used to modify a cell that contains a 
normally transcriptionally silent GDNFR gene or under expressed gene and thereby 
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produce a cell which expresses GDNFR. Homologous recombination is a technique 
originally developed for targeting genes to induce or correct mutations in 
transcriptionally active genes (Kucherlapati, Prog, in Nucl. Acid Res. and Mol. Biol., 
36:301, 1989). The basic technique was developed as a method for introducing 

5 specific mutations into specific regions of the mammalian genome (Thomas et al., 
Cell, 44:419-428, 1986; Thomas and Capecchi, Cell, 51:503-512, 1987; Doetschman 
et al., Proc. Natl. Acad. Sci., 85:8583-8587, 1988) or to correct specific mutations 
within defective genes (Doetschman et al, Nature, 330:576-578, 1987). Exemplary 
homologous recombination techniques are described in U.S. 5,272,071 

10 (EP 91 90 3051, EP Publication No. 505 500; PCT/US90/07642, International 
Publication No. WO 91/09955) the disclosure of which is hereby incorporated by 
reference. 

Through homologous recombination, the DNA sequence to be inserted into the 
genome can be directed to a specific region of the gene of interest by attaching it to 

15 targeting DNA. The targeting DNA is DNA that is complementary (homologous) to a 
region of the genomic DNA. Small pieces of targeting DNA that are complementary to 
a specific region of the genome are put in contact with the parental strand during the 
DNA replication process. It is a general property of DNA that has been inserted into a 
cell to hybridize, and therefore, recombine with other pieces of endogenous DNA 

20 through shared homologous regions. If this complementary strand is attached to an 
oligonucleotide that contains a mutation or a different sequence of DNA, it too is 
incorporated into the newly synthesized strand as a result of the recombination. As a 
result of the proofreading function, it is possible for the new sequence of DNA to 
serve as the template. Thus, the transferred DNA is incorporated into the genome. 

25 If the sequence of a particular gene is known, such as the nucleic acid 

sequence, the pre-pro sequence or expression control sequence of GDNFR presented 
herein, a piece of DNA that is complementary to a selected region of the gene can be 
synthesized or otherwise obtained, such as by appropriate restriction of the native 
DNA at specific recognition sites bounding the region of interest. This piece serves as 

30 a targeting sequence upon insertion into the ceil and will hybridize to its homologous 
region within the genome. If this hybridization occurs during DNA replication, this 
piece of DNA, and any additional sequence attached thereto, will act as an Okazaki 
fragment and will be backstitched into the newly synthesized daughter strand of DNA. 
Attached to these pieces of targeting DNA are regions of DNA which may 

35 interact with the expression of a GDNFR protein. For example, a promoter/enhancer 
element, a suppresses or an exogenous transcription modulatory element is inserted in 
the genome of the intended host cell in proximity and orientation sufficient to influence 
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the transcription of DNA encoding the desired GDNFR protein. The control element 
does not encode GDNFR, but instead controls a portion of the DNA present in the 
host cell genome. Thus, the expression of GDNFR proteins may be achieved not by 
transfection of DNA that encodes the GDNFR gene itself, but rather by the use of 



coupled with DNA regulatory segments that provide the endogenous gene sequence 
with recognizable signals for transcription of a GDNFR protein. 

A. GDNFR variants 

As discussed above, the terms "GDNFR analogs" as used herein include 
polypeptides in which amino acids have been deleted from ("deletion variants"), 
inserted into ("addition variants"), or substituted for ("substitution variants") residues 
within the amino acid sequence of naturally-occurring GDNFR polypeptides including 
those depicted in Figures 2 and 4 (SEQ. ID. NOs.:2 and 4). Such variants are 
prepared by introducing appropriate nucleotide changes into the DNA encoding the 
polypeptide or by in vitro chemical synthesis of the desired polypeptide. It will be 
appreciated by those skilled in the art that many combinations of deletions, insertions, 
and substitutions can be made to an amino acid sequence such as mature human 
GDNFR provided that the final molecule possesses GDNFR activity. 

Based upon the present description of GDNFR amino acid sequences, one can 
readily design and manufacture a variety of nucleic acid sequences suitable for use in 
the recombinant (e.g., microbial) expression of polypeptides having primary 
conformations which differ from those depicted in the Figures in terms of the identity 
or location of one or more residues. Mutagenesis techniques for the replacement, 
insertion or deletion of one or more selected amino acid residues encoded by the 
nucleic acid sequences depicted in Figures 2 and 4 are well known to one skilled in the 
art (e.g., U.S. Pat. No. 4,518,584, the disclosure of which is hereby incorporated by 
reference.) There are two principal variables in the construction of substitution 
variants: the location of the mutation site and the nature of the mutation. In designing 
GDNFR substitution variants, the selection of the mutation site and nature of the 
mutation will depend on the GDNFR characteristic(s) to be modified. The sites for 
mutation can be modified individually or in series, e.g., by (1) substituting first with 
conservative amino acid modifications and then with more radical selections depending 
upon the results achieved, (2) deleting the target amino acid residue, or (3) inserting 
amino acid residues adjacent to the located site. Conservative changes in from 1 to 30 
contiguous amino acids are preferred. N-terminal and C-terminal deletion GDNFR 
protein variants may also be generated by proteolytic enzymes. 
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For GDNFR deletion variants, deletions generally range from about 1 to 30 
contiguous residues, more usually from about 1 to 10 contiguous residues, and 
typically from about 1 to 5 contiguous residues. N-terminal, C-terminal and internal 
intrasequence deletions are contemplated. Deletions may be introduced into regions of 

5 the molecule which have low homology with non-human GDNFR to modify the 
activity of GDNFR. Deletions in areas of substantial homology with non-human 
GDNFR sequences will be more likely to significantly modify GDNFR biological 
activity. The number of consecutive deletions typically will be selected so as to 
preserve the tertiary structure of the GDNFR protein product in the affected domain, 

10 e.g., cysteine crosslinking. Non-limiting examples of deletion variants include 
truncated GDNFR protein products lacking N-terminal or C-tenninal amino acid 
residues. For example, one may prepare a soluble receptor by elimination of the 
peptide region involved in a glycosyl-phosphatidylinositol (GPI) anchorage of 
GDNFR receptor to the cytoplasmic membrane. 

1 5 For GDNFR addition variants, amino acid sequence additions typically include 

N-and/or C-terminal fusions or terminal additions ranging in length from one residue 
to polypeptides containing a hundred or more residues, as well as internal or medial 
additions of single or multiple amino acid residues. Polypeptides of the invention may 
also include an initial methionine amino acid residue (at position - 1 with respect to the 

20 first amino acid residue of the desired polypeptide). Internal additions may range 
generally from about 1 to 10 contiguous residues, more typically from about 1 to 5 
residues, and usually from about 1 to 3 amino acid residues. Examples of N-terminal 
addition variants include GDNFR with the inclusion of a heterologous N-terminal 
signal sequence to the N-terminus of GDNFR to facilitate the secretion of mature 

25 GDNFR from recombinant host cells and thereby facilitate harvesting or 

bioavailability. Such signal sequences generally will be obtained from, and thus be 
homologous to, the intended host cell species. Additions may also include amino acid 
sequences derived from the sequence of other neurotrophic factors. For example, it is 
contemplated that a fusion protein of GDNF and GDNFR may be produced, with or 

30 without a linking sequence, thereby forming a single molecule therapeutic entity. 

GDNFR substitution variants have one or more amino acid residues of the 
GDNFR amino acid sequence removed and a different residue(s) inserted in its place. 
Such substitution variants include allelic variants, which are characterized by 
naturally-occurring nucleotide sequence changes in the species population that may or 

35 may not result in an amino acid change. As with the other variant forms, substitution 
variants may involve the replacement of single or contiguous amino acid residues at 
one or more different locations. 
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Specific mutations of the GDNFR amino acid sequence may involve 
modifications to a glycosylation site (e.g., serine, threonine, or asparagine). The 
absence of glycosylation or only partial glycosylation results from amino acid 
substitution or deletion at any asparagine-linked glycosylation recognition site or at any 

5 SlLC Ul LUW iiiuibwuiv* uiai 10 iiiw\uiiwu uj auumuu v/i an wiuiawu vui \j\jny uj aiA*. rvu 

asparagine-linked glycosylation recognition site comprises a tripeptide sequence which 
is specifically recognized by appropriate cellular glycosylation enzymes. These 
tripeptide sequences are either Asn-Xaa-Thr or Asn-Xaa-Ser, where Xaa can be any 
amino acid other than Pro. A variety of amino acid substitutions or deletions at one or 

10 both of the first or third amino acid positions of a glycosylation recognition site (and/or 
amino acid deletion at the second position) result in non-glycosylation at the modified 
tripeptide sequence. Thus, the expression of appropriate altered nucleotide sequences 
produces variants which are not glycosylated at that site. Alternatively, the GDNFR 
amino acid sequence may be modified to add glycosylation sites. 

15 One method for identifying GDNFR amino acid residues or regions for 

mutagenesis is called "alanine scanning mutagenesis" as described by Cunningham 
and Wells (Science, 244: 1081-1085, 1989). In this method, an amino acid residue or 
group of target residues are identified (e.g., charged residues such as Arg, Asp, His, 
Lys, and Glu) and replaced by a neutral or negatively charged amino acid (most 

20 preferably alanine or polyalanine) to affect the interaction of the amino acids with the 
surrounding aqueous environment in or outside the cell. Those domains 
demonstrating functional sensitivity to the substitutions may then be refined by 
introducing additional or alternate residues at the sites of substitution. Thus, the target 
site for introducing an amino acid sequence variation is determined, alanine scanning 

25 or random mutagenesis is conducted on the corresponding target codon or region of 
the DN A sequence, and the expressed GDNFR variants are screened for the optimal 
combination of desired activity and degree of activity. 

The sites of greatest interest for substitutional mutagenesis include sites where 
the amino acids found in GDNFR proteins from various species are substantially 

30 different in terms of side-chain bulk, charge, and/or hydrophobicity. Other sites of 
interest are those in which particular residues of GDNFR-like proteins, obtained from 
various species, are identical. Such positions are generally important for the biological 
activity of a protein. Initially, these sites are substituted in a relatively conservative 
manner. Such conservative substitutions are shown in Table 2 under the heading of 

35 preferred substitutions. If such substitutions result in a change in biological activity, 
then more substantial changes (exemplary substitutions) may be introduced, and/or 
other additions or deletions may be made, and the resulting products are screened for 
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TABLE 2 
Amino Acid Substitutions 



Original Residue Preferred Substitutions 



Exemplary Substitutions 



Ala (A) 
Arg(R) 
Asn (N) 
Asp(D) 
Cys (C) 
Gln(Q) 
Glu (E) 
Gly (G) 
His (H) 
lie (I) 

Leu(L) 

Lys (K) 
Met(M) 
Phe (F) 
Pro (P) 
Ser (S) 
Thr(T) 
Trp(W) 
Tyr(Y) 
Val(V) 



Val 
Lys 
Gin 
Glu 
Ser 
Asn 
Asp 
Pro 
Arg 
Leu 

lie 

Arg 
Leu 
Leu 
Gly 
Thr 
Ser 
Tyr 
Phe 
Leu 



Val; Leu; He 

Lys; Gin; Asn 

Gin; His; Lys; Arg 

Glu 

Ser 

Asn 

Asp 

Pro 

Asn; Gin; Lys; Arg 
Leu; Val; Met; Ala; Phe; 
norleucine 

norleucine; lie; Val; Met; Ala; 
Phe 

Arg; Gin; Asn 

Leu; Phe; Be 

Leu; Val; lie; Ala 

Gly 

Thr 

Ser 

Tyr 

Tip; Phe; Thr; Ser 
lie; Leu; Met; Phe; Ala; 
norleucine 



Conservative modifications to the amino acid sequence (and the corresponding 
modifications to the encoding nucleic acid sequences) are expected to produce GDNFR 
protein products having functional and chemical characteristics similar to those of 
10 naturally occurring GDNFR. In contrast, substantial modifications in the functional 
and/or chemical characteristics of GDNFR protein products may be accomplished by 
selecting substitutions that differ significantly in their effect on maintaining (a) the 
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structure of the polypeptide backbone in the area of the substitution, for example, as a 
sheet or helical conformation, (b) the charge or hydrophobicity of the molecule at the 
target site, or (c) the bulk of the side chain. Naturally occurring residues may be 
divided into groups based on common side chain properties: 
5 i) hydrophobic; noficucme, Met, Ala, Val, Leu, lie; 

2) neutral hydrophilic: Cys, Ser, Thr; 

3) acidic: Asp, Glu; 

4) basic: Asn, Gin, His, Lys, Arg; 

5) residues that influence chain orientation: Gly, Pro; and 
10 6) aromatic: Trp, Tyr, Phe. 

Non-conservative substitutions may involve the exchange of a member of one 
of these classes for a member from another class. Such substituted residues may be 
introduced into regions of the human GDNFR protein that are homologous with non- 
human GDNFR proteins, or into the non-homologous regions of the molecule. 

1 5 Thus, GDNFR proteins, analogs, or derivatives thereof include, but are not 

limited to, those biologically active molecules containing, as a primary amino acid 
sequence, all or part of the amino acid sequences as depicted in Figures 2 and 4 (SEQ 
ID NOs. 2 and 4). The proteins will include altered sequences in which biologically 
equivalent amino acid residues are substituted for residues within the sequence 

20 resulting in a silent change. For example, one or more amino acid residues within the 
sequence can be substituted by another amino acid of a similar polarity which acts as a 
functional equivalent, resulting in a silent alteration. Substitutes for an amino acid 
within the sequence may be selected from other members of the class to which the 
amino acid belongs. For example, the nonpolar (hydrophobic) amino acids include 

25 alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan and 

methionine. The polar neutral amino acids include glycine, serine, threonine, cysteine, 
tyrosine, asparagine, and glutamine. The positively charged (basic) amino acids 
include arginine, lysine and histidine. The negatively charged (acidic) amino acids 
include aspartic acid and glutamic acid. It is also contemplated that the GDNFR 

30 proteins, analogs, or fragments or derivatives thereof may be differentially modified 
during or after translation, e.g., by phosphorylation, glycosylation, crosslinking, 
acylation, proteolytic cleavage, linkage to an antibody molecule, membrane molecule 
or other ligand. 



35 



B. GDNFR Derivatives 

Chemically modified derivatives of GDNFR or GDNFR analogs may be 
prepared by one of skill in the art based upon the present disclosure. The chemical 



WO 97/40152 




CT/US97/06281 



moieties most suitable for derivatization include water soluble polymers. A water 
soluble polymer is desirable because the protein to which it is attached does not 
precipitate in an aqueous environment, such as a physiological environment. 
Preferably, the polymer will be pharmaceutically acceptable for the preparation of a 

5 therapeutic product or composition. One skilled in the art will be able to select the 
desired polymer based on such considerations as whether the polymer/protein 
conjugate will be used therapeutically, and if so, the desired dosage, circulation time, 
resistance to proteolysis, and other considerations. The effectiveness of the 
derivatization may be ascertained by administering the derivative, in the desired form 

10 (e.g., by osmotic pump, or, more preferably, by injection or infusion, or, further 
formulated for oral, pulmonary or other delivery routes), and determining its 
effectiveness. 

Suitable water soluble polymers include, but are not limited to, polyethylene 
glycol, copolymers of ethylene glycol/propylene glycol, carboxymethylcellulose, 

1 5 dextran, polyvinyl alcohol, polyvinyl pyrrolidone, poly- 1 , 3-dioxolane, poly- 1 ,3,6- 
trioxane, ethylene/maleic anhydride copolymer, polyaminoacids (either homopolymers 
or random copolymers), and dextran or poly(n- vinyl pyrrolidone)polyethylene glycol, 
propropylene glycol homopolymers, prolypropylene oxide/ethylene oxide co- 
polymers, polyoxyethylated polyols (e.g., glycerol), polyvinyl alcohol, and mixtures 

20 thereof. Polyethylene glycol propionaldehyde may have advantages in manufacturing 
due to its stability in water. 

The polymer may be of any molecular weight, and may be branched or 
unbranched. For polyethylene glycol, the preferred molecular weight is between about 
2 kDa and about 100 kDa for ease in handling and manufacturing (the term "about" 

25 indicating that in preparations of polyethylene glycol, some molecules will weigh 
more, some less, than the stated molecular weight). Other sizes may be used, 
depending on the desired therapeutic profile (e.g., the duration of sustained release 
desired; the effects, if any, on biological activity; the ease in handling; the degree or 
lack of antigenicity and other known effects of polyethylene glycol on a therapeutic 

30 protein or variant). 

The number of polymer molecules so attached may vary, and one skilled in the 
art will be able to ascertain the effect on function. One may mono-derivatize, or may 
provide for a di-, tri-, tetra- or some combination of derivatization, with the same or 
different chemical moieties (e.g., polymers, such as different weights of polyethylene 

35 glycols). The proportion of polymer molecules to protein (or peptide) molecules will 
vary, as will their concentrations in the reaction mixture. In general, the optimum ratio 
(in terms of efficiency of reaction in that there is no excess unreacted protein or 



WO 97/40152 




PCT/US97/06281 



44 



polymer) will be determined by factors such as the desired degree of derivatization 
(e.g., mono, di-, tri-, etc.), the molecular weight of the polymer selected, whether the 
polymer is branched or unbranched, and the reaction conditions. 

The polyethylene glycol molecules (or other chemical moieties) should be 
5 attached to the protein with consideration of effects on functional or antigenic domains 
of the protein. There are a number of attachment methods available to those skilled in 
the art. See for example, EP 0 401 384, the disclosure of which is hereby 
incorporated by reference (coupling PEG to G-CSF), see also Malik et al., Exp. 
Hematol., 20: 1028-1035, 1992 (reporting pegylation of GM-CSF using tresyl 

10 chloride). For example, polyethylene glycol may be covalently bound through amino 
acid residues via a reactive group, such as, a free amino or carboxyl group. Reactive 
groups are those to which an activated polyethylene glycol molecule may be bound. 
The amino acid residues having a free amino group may include lysine residues and 
the N-terminal amino acid residue. Those having a free carboxyl group may include 

15 aspartic acid residues, glutamic acid residues, and the C-terminal amino acid residue. 
Sulfhydrl groups may also be used as a reactive group for attaching the polyethylene 
glycol molecule(s). For therapeutic purposes, attachment at an amino group, such as 
attachment at the N-terminus or lysine group is preferred. Attachment at residues 
important for receptor binding should be avoided if receptor binding is desired. 

20 One may specifically desire an N-terminal chemically modified protein. Using 

polyethylene glycol as an illustration of the present compositions, one may select from 
a variety of polyethylene glycol molecules (by molecular weight, branching, etc.), the 
proportion of polyethylene glycol molecules to protein (or peptide) molecules in the 
reaction mix, the type of pegylation reaction to be performed, and the method of 

25 obtaining the selected N-terminally pegylated protein. The method of obtaining the 
N-terminally pegylated preparation (i.e., separating this moiety from other 
monopegylated moieties if necessary) may be by purification of the N-terminally 
pegylated material from a population of pegylated protein molecules. Selective N- 
terminal chemical modification may be accomplished by reductive alkylation which 

30 exploits differential reactivity of different types of primary amino groups (lysine 
versus the N-terminal) available for derivatization in a particular protein. Under the 
appropriate reaction conditions, substantially selective derivatization of the protein at 
the N-terminus with a carbonyl group containing polymer is achieved. For example, 
one may selectively N-terminally pegylate the protein by performing the reaction at a 

35 pH which allows one to take advantage of the pKa differences between the e-amino 
group of the lysine residues and that of the a-amino group of the N-terminal residue of 
the protein. By such selective derivatization, attachment of a water soluble polymer to 
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a protein is controlled: the conjugation with the polymer takes place predominantly at 
the N-terminus of the protein and no significant modification of other reactive groups, 
such as the lysine side chain amino groups, occurs. Using reductive alkylation, the 
water soluble polymer may be of the type described above, and should have a single 
5 reactive aldehyde for coupling to the protein. Polyethylene glycol propionaldehyde, 
containing a single reactive aldehyde, may be used. 

The present invention contemplates use of derivatives which are 
prokaryote-expressed GDNFR, or variants thereof, linked to at least one polyethylene 
glycol molecule, as well as use of GDNFR, or variants thereof, attached to one or 

10 more polyethylene glycol molecules via an acyl or alkyl linkage. 

Pegylation may be carried out by any of the pegylation reactions known in the 
art. See, for example: Focus on Growth Factors, 3 (2): 4-10, 1992; EP 0 154 316, 
the disclosure of which is hereby incorporated by reference; EP 0 401 384; and the 
other publications cited herein that relate to pegylation. The pegylation may be carried 

15 out via an acylation reaction or an alkylation reaction with a reactive polyethylene 
glycol molecule (or an analogous reactive water-soluble polymer). 

Pegylation by acylation generally involves reacting an active ester derivative of 
polyethylene glycol (PEG) with the GDNFR protein or variant. Any known or 
subsequently discovered reactive PEG molecule may be used to carry out the 

20 pegylation of GDNFR protein or variant. A preferred activated PEG ester is PEG 
esterified to N-hydroxysuccinimide (NHS). As used herein, "acylation" is 
contemplated to include without limitation the following types of linkages between the 
therapeutic protein and a water soluble polymer such as PEG: amide, carbamate, 
urethane, and the like. See Bioconjugate Chem., 5: 133-140, 1994. Reaction 

25 conditions may be selected from any of those known in the pegylation ait or those 
subsequently developed, but should avoid conditions such as temperature, solvent, 
and pH that would inactivate the GDNFR or variant to be modified. 

Pegylation by acylation will generally result in a poly-pegylated GDNFR 
protein or variant. Preferably, the connecting linkage will be an amide. Also 

30 preferably, the resulting product will be substantially only (e.g., > 95%) mono, di- or 
tri-pegylated. However, some species with higher degrees of pegylation may be 
formed in amounts depending on the specific reaction conditions used. If desired, 
more purified pegylated species may be separated from the mixture, particularly 
unreacted species, by standard purification techniques, including, among others, 

35 dialysis, salting-out, ultrafiltration, ion-exchange chromatography, gel filtration 
chromatography and electrophoresis. 

Pegylation by alkylation generally involves reacting a terminal aldehyde 
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derivative of PEG with the GDNFR protein or variant in the presence of a reducing 
agent. Pegylation by alkylation can also result in poly-pegylated GDNFR protein or 
variant. In addition, one can manipulate the reaction conditions to favor pegylation 
substantially only at the a-amino group of the N-terminus of the GDNFR protein or 
5 variant (i.e., a mcnc-pcgylatcd protein). In cither case of ixionopcgyiation or 

polypegylation, the PEG groups are preferably attached to the protein via a -CH2-NH- 
group. With particular reference to the -CH2- group, this type of linkage is referred to 
herein as an "alky!" linkage. 

Derivatization via reductive alkylation to produce a monopegylated product 

10 exploits differential reactivity of different types of primary amino groups (lysine 

versus the N-terminal) available for derivatization. The reaction is performed at a pH 
which allows one to take advantage of the pKa differences between the e-amino 
groups of the lysine residues and that of the a-amino group of the N-terminal residue 
of the protein. By such selective derivatization, attachment of a water soluble polymer 

15 that contains a reactive group such as an aldehyde, to a protein is controlled: the 
conjugation with the polymer takes place predominantly at the N-terminus of the 
protein and no significant modification of other reactive groups, such as the lysine side 
chain amino groups, occurs. In one important aspect, the present invention 
contemplates use of a substantially homogeneous preparation of 

20 monopolymer/GDNFR protein (or variant) conjugate molecules (meaning GDNF 
protein or variant to which a polymer molecule has been attached substantially only 
(i.e., > 95%) in a single location). More specifically, if polyethylene glycol is used, 
the present invention also encompasses use of pegylated GDNFR protein or variant 
lacking possibly antigenic linking groups, and having the polyethylene glycol molecule 

25 directly coupled to the GDNFR protein or variant. 

Thus, GDNFR protein products according to the present invention include 
pegylated GDNFR protein or variants, wherein the PEG group(s) is (are) attached via 
acyl or alkyl groups. As discussed above, such products may be mono-pegylated or 
poly-pegylated (e.g., containing 2-6, and preferably 2-5, PEG groups). The PEG 

30 groups are generally attached to the protein at the a- or e-amino groups of amino acids, 
but it is also contemplated that the PEG groups could be attached to any amino group 
attached to the protein, which is sufficiently reactive to become attached to a PEG 
group under suitable reaction conditions. 

The polymer molecules used in both the acylation and alkylation approaches 

35 may be selected from among water soluble polymers as described above. The polymer 
selected should be modified to have a single reactive group, such as an active ester for 
acylation or an aldehyde for alkylation, preferably, so that the degree of polymerization 
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may be controlled as provided for in the present methods. An exemplary reactive PEG 
aldehyde is polyethylene glycol propionaldehyde, which is water stable, or mono 
C1-C10 alkoxy or aryloxy derivatives thereof (see, U.S. Patent 5,252,714). The 
polymer may be branched or unbranched. For the acylation reactions, the polymer(s) 
5 selected should have a single reactive ester group. For the present reductive 
alkylation, the polymer(s) selected should have a single reactive aldehyde group. 
Generally, the water soluble polymer will not be selected from naturally-occurring 
glycosyl residues since these are usually made more conveniently by mammalian 
recombinant expression systems. The polymer may be of any molecular weight, and 
10 may be branched or unbranched. 

An exemplary water-soluble polymer for use herein is polyethylene glycol. As 
used herein, polyethylene glycol is meant to encompass any of the forms of PEG that 
have been used to derivatize other proteins, such as mono-(Cl-C10) alkoxy- or 
aryloxy-polyethylene glycol. 
1 5 In general, chemical derivatization may be performed under any suitable 

condition used to react a biologically active substance with an activated polymer 
molecule. Methods for preparing a pegylated GDNFR protein product will generally 
comprise the steps of (a) reacting a GDNFR protein product with polyethylene glycol 
(such as a reactive ester or aldehyde derivative of PEG) under conditions whereby the 
20 protein becomes attached to one or more PEG groups, and (b) obtaining the reaction 
produces). In general, the optimal reaction conditions for the acylation reactions will 
be determined case-by-case based on known parameters and the desired result. For 
example, the larger the ratio of PEG:protein, the greater the percentage of 
poly-pegylated product 
25 Reductive alkylation to produce a substantially homogeneous population of 

moho-polymer/GDNFR protein product will generally comprise the steps of: 
(a) reacting a GDNFR protein or variant with a reactive PEG molecule under reductive 
alkylation conditions, at a pH suitable to permit selective modification of the a-amino 
group at the amino terminus of said GDNFR protein or variant; and (b) obtaining the 
30 reaction produces). 

For a substantially homogeneous population of mono-polymer/GDNFR 
protein product, the reductive alkylation reaction conditions are those which permit the 
selective attachment of the water soluble polymer moiety to the N-terminus of GDNFR 
protein or variant. Such reaction conditions generally provide for pKa differences 
35 between the lysine amino groups and the a-amino group at the N-terrninus (the pKa 
being the pH at which 50% of the amino groups are protonated and 50% are not). The 
pH also affects the ratio of polymer to protein to be used. In general, if the pH is 
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lower, a larger excess of polymer to protein will be desired (i.e., the less reactive the 
N- terminal a-amino group, the more polymer needed to achieve optimal conditions). 
If the pH is higher, the polymenprotein ratio need not be as large (i.e., more reactive 
groups are available, so fewer polymer molecules are needed). For purposes of the 
5 present invention, the pH wiii generally fail within the range of 3-9, preferably 3-6. 

Another important consideration is the molecular weight of the polymer. In 
general, the higher the molecular weight of the polymer, the fewer polymer molecules 
may be attached to the protein. Similarly, branching of the polymer should be taken 
into account when optimizing these parameters. Generally, the higher the molecular 

10 weight (or the more branches) the higher the polymenprotein ratio. In general, for the 
pegylation reactions contemplated herein, the preferred average molecular weight is 
about 2 kDa to about 100 kDa, The preferred average molecular weight is about 5 kDa 
to about 50 kDa, particularly preferably about 12 kDa to about 25 kDa. The ratio of 
water-soluble polymer to GDNF protein or variant will generally range from 1 : 1 to 

15 1 00: 1 , preferably (for polypegylation) 1:1 to 20: 1 and (for monopegylation) 1:1 to 5 : 1 . 
Using the conditions indicated above, reductive alkylation will provide for 
selective attachment of the polymer to any GDNFR protein or variant having an 
a-amino group at the amino terminus, and provide for a substantially homogenous 
preparation of monopolymer/GDNFR protein (or variant) conjugate. The term 

20 "monopolymer/GDNFR protein (or variant) conjugate" is used here to mean a 
composition comprised of a single polymer molecule attached to a molecule of 
GDNFR protein or GDNFR variant protein. The monopolymer/GDNFR protein (or 
variant) conjugate typically will have a polymer molecule located at the N-terminus, 
but not on lysine amino side groups. The preparation will generally be greater than 

25 90% monopolymer/GDNFR protein (or variant) conjugate, and more usually greater 
than 95% monopolymer/GDNFR protein (or variant) conjugate, with the remainder of 
observable molecules being unreacted (i.e., protein lacking the polymer moiety). It is 
also envisioned that the GDNFR protein product may involve the preparation of a 
pegylated molecule involving a fusion protein or linked GDNFR and GDNF 

30 molecules. 

For the present reductive alkylation, the reducing agent should be stable in 
aqueous solution and preferably be able to reduce only the Schiffbase formed in the 
initial process of reductive alkylation. Suitable reducing agents may be selected from 
sodium borohydride, sodium cyanoborohydride, dimethylamine borane, 
35 trimethylamine borane and pyridine borane. A particularly suitable reducing agent is 
sodium cyanoborohydride. Other reaction parameters, such as solvent, reaction times, 
temperatures, etc., and means of purification of products, can be determined 
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case-by-case based on the published information relating to derivatization of proteins 
with water soluble polymers (see die publications cited herein). 

c. qpNTR Protein Product Phannacwtiwi CQmppsitions 
5 GDNFR protein product pharmaceutical compositions typically include a 

therapeutically or prophylactically effective amount of GDNFR protein product in 
admixture with one or more pharmaceutically and physiologically acceptable 
formulation materials selected for suitability with the mode of administration. Suitable 
formulation materials include, but are not limited to, antioxidants, preservatives, 

10 coloring, flavoring and diluting agents, emulsifying agents, suspending agents, 

solvents, fillers, bulking agents, buffers, delivery vehicles, diluents, excipients and/or 
pharmaceutical adjuvants. For example, a suitable vehicle may be water for injection, 
physiological saline solution, or artificial cerebrospinal fluid, possibly supplemented 
with other materials common in compositions for parenteral administration. Neutral 

1 5 buffered saline or saline mixed with serum albumin are further exemplary vehicles. 
The term "pharmaceutical^ acceptable earner" or "physiologically acceptable carrier" 
as used herein refers to a formulation material(s) suitable for accomplishing or 
enhancing the delivery of the GDNFR protein product as a pharmaceutical 
composition. 

20 The primary solvent in a vehicle may be either aqueous or non-aqueous in 

nature. In addition, the vehicle may contain other formulation materials for modifying 
or maintaining the pH, osmolality, viscosity, clarity, color, sterility, stability, rate of 
dissolution, or odor of the formulation. Similarly, the vehicle may contain additional 
formulation materials for modifying or maintaining the rate of release of GDNFR 

25 protein product, or for promoting the absorption or penetration of GDNFR protein 
product across the blood-brain barrier. 

Once the therapeutic pharmaceutical composition has been formulated, it may 
be stored in sterile vials as a solution, suspension, gel, emulsion, solid, or dehydrated 
or lyophilized powder. Such formulations may be stored either in a ready to use form 

30 or in a form (e.g., lyophilized) requiring reconstitution prior to administration. 

The optimal pharmaceutical formulation will be determined by one skilled in 
the art depending upon the intended route of administration and desired dosage. See 
for example, Remington's Pharmaceutical Sciences, 18th Ed. (1990, Mack Publishing 
Co., Easton, PA 18042) pages 1435-1712, the disclosure of which is hereby 

35 incorporated by reference. Such compositions may influence the physical state, 
stability, rate of in vivo release, and rate of in vivo clearance of the present proteins 
and derivatives. 
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Effective administration forms, such as (1) slow-release formulations, (2) 
inhalant mists, or (3) orally active formulations are envisioned. The GDNFR protein 
product pharmaceutical composition also may be formulated for parenteral 
administration. Such parenterally administered therapeutic compositions are typically 

5 in the form of a pyrogen-free, parenteraily acceptable aqueous solution comprising the 
GDNFR protein product in a pharmaceutical^ acceptable vehicle. One preferred 
vehicle is physiological saline. The GDNFR protein product pharmaceutical 
compositions also may include particulate preparations of polymeric compounds such 
as polylactic acid, polyglycolic acid, etc. or into liposomes. Hyaluronic acid may also 

10 be used, and this may have the effect of promoting sustained duration in the 
circulation. 

A particularly suitable vehicle for parenteral injection is sterile distilled water in 
which the GDNFR protein product is formulated as a sterile, isotonic solution, 
properly preserved. Yet another preparation may involve the formulation of the 
15 GDNFR protein product with an agent, such as injectable microspheres or liposomes, 
that provides for the slow or sustained release of the protein which may then be 
delivered as a depot injection. Other suitable means for the introduction of GDNFR 
protein product include implantable drug delivery devices which contain the GDNFR 
protein product. 

20 The preparations of the present invention may include other components, for 

example parenterally acceptable preservatives, tonicity agents, cosolvents, wetting 
agents, complexing agents, buffering agents, antimicrobials, antioxidants and 
surfactants, as are well known in the art. For example, suitable tonicity enhancing 
agents include alkali metal halides (preferably sodium or potassium chloride), 

25 mannitol, sorbitol and the like. Suitable preservatives include, but are not limited to, 
benzalkonium chloride, thimerosal, phenethyl alcohol, methylparaben, propylparaben, 
chlorhexidine, sorbic acid and the like. Hydrogen peroxide may also be used as 
preservative. Suitable cosolvents are for example glycerin, propylene glycol and 
polyethylene glycol Suitable complexing agents are for example caffeine, 

30 polyvinylpyrrolidone, beta-cyclodextrin or hydroxypropyl-beta-cyclodextrin. Suitable 
surfactants or wetting agents include sorbitan esters, polysorbates such as polysorbate 
80, tromethamine, lecithin, cholesterol, tyloxapal and the like. The buffers can be 
conventional buffers such as borate, citrate, phosphate, bicarbonate, or Tris-HCl. 

The formulation components are present in concentration that are acceptable to 

35 the site of administration. For example, buffers are used to maintain the composition 
at physiological pH or at slightly lower pH, typically within a pH range of from about 
5 to about 8. 
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A pharmaceutical composition may be formulated for inhalation. For example, 
the GDNFR protein product may be formulated as a dry powder for inhalation. 
GDNFR protein product inhalation solutions may also be formulated in a liquefied 
propellant for aerosol delivery. In yet another formulation, solutions may be 
5 nebulized. 

It is also contemplated that certain formulations containing GDNFR protein 
product are to be administered orally. GDNFR protein product which is administered 
in this fashion may be formulated with or without those carriers customarily used in 
the compounding of solid dosage forms such as tablets and capsules. For example, a 

1 o capsule may be designed to release the active portion of the formulation at the point in 
the gastrointestinal tract when bioavailability is maximized and pre-systemic 
degradation is minim i z ed. Additional formulation materials may be included to 
facilitate absorption of GDNFR protein product. Diluents, flavorings, low melting 
point waxes, vegetable oils, lubricants, suspending agents, tablet disintegrating 

15 agents, and binders may also be employed. 

Another preparation may involve an effective quantity of GDNFR protein 
product in a mixture with non-toxic excipients which are suitable for the manufacture 
of tablets. By dissolving the tablets in sterile water, or other appropriate vehicle, 
solutions can be prepared in unit dose form. Suitable excipients include, but are not 

20 limited to, inert diluents, such as calcium carbonate, sodium carbonate or bicarbonate, 
lactose, or calcium phosphate; or binding agents, such as starch, gelatin, or acacia; or 
lubricating agents such as magnesium stearate, stearic acid, or talc. 

Additional GDNFR protein product formulations will be evident to those 
skilled in the art, including formulations involving GDNFR protein product in 

25 combination with GDNF protein product. Techniques for formulating a variety of 
other sustained- or controlled-delivery means, such as liposome carriers, bio-erodible 
microparticles or porous beads and depot injections, are also known to those skilled in 
the art. See, for example, Supersaxo et al. description of controlled release porous 
polymeric microparticles for the delivery of pharmaceutical compositions (International 

30 Publication No. WO 93/1 5722; International Application No. PCT/US93/00829) the 
disclosure of which is hereby incorporated by reference. 

D. Administration of GDNFR Protein Product 

The GDNFR protein product may be administered parenterally via a variety of 
35 routes, including subcutaneous, intramuscular, intravenous, transpulmonary, 

transdermal, intrathecal and intracerebral delivery. In addition, protein factors that do 
not readily cross the blood-brain barrier may be given directly intracerebrally or 
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otherwise in association with other elements that will transport them across the barrier. 
For example, the GDNFR protein product may be administered 
intracerebroventricularly or into the brain or spinal cord subarachnoid space. GDNFR 
protein product may also be administered intracerebrally directly into the brain 

that has been modified chemically or packaged so that it passes the blood-brain barrier, 
or with one or more agents capable of promoting penetration of GDNFR protein 
product across the barrier. For example, a conjugate of NGF and monoclonal anti- 
transferrin receptor antibodies has been shown to be transported to the brain via 

l o binding to transferrin receptors. 

To achieve the desired level of GDNFR protein product, repeated daily or less 
frequent injections may be adkninistered, or GDNFR protein product may be infused 
continuously or periodically from a constant- or programmable-flow implanted pump. 
Slow-releasing implants containing the neurotrophic factor embedded in a 

1 5 biodegradable polymer matrix can also deliver GDNFR protein product. The 

frequency of dosing will depend on the pharmacokinetic parameters of the GDNFR 
protein product as formulated, and the route and site of administration. 

Regardless of the manner of administration, the specific dose may be calculated 
according to body weight, body surface area or organ size. Further refinement of the 

20 calculations necessary to determine the appropriate dosage for treatment involving each 
of the above mentioned formulations is routinely made by those of ordinaiy skill in the 
art and is within the ambit of tasks routinely performed by them. Appropriate dosages 
may be ascertained through use of appropriate dose-response data. 

The final dosage regimen involved in a method for treating a specific injury or 

25 condition will be determined by the attending physician. Generally, an effective 

amount of the present GDNFR polypeptides will be determined by considering various 
factors which modify the action of drugs, e.g. the age, condition, body weight, sex 
and diet of the patient, the severity of any infection, time of administration and other 
clinical factors. See, Remington's Pharmaceutical Sciences, supra, at pages 697-773, 

30 herein incorporated by reference. It is contemplated that if GDNFR is used to enhance 
GDNF action, then the GDNFR dose is selected to be similar to that required for 
GDNF therapy; if GDNFR is used to antagonize GDNF action, then the GDNFR dose 
would be several many times the GDNF dose. Dosing may be one or more times 
daily, or less frequently, and may be in conjunction with other compositions as 

35 described herein. It should be noted that the present invention is not limited to the 
dosages recited herein. 

It is envisioned that the continuous administration or sustained delivery of 
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GDNFR protein products may be advantageous for a given treatment. While 
continuous administration may be accomplished via a mechanical means, such as with 
an infusion pump, it is contemplated that other modes of continuous or near 
continuous administration may be practiced. For example, chemical derivatization or 
5 encapsulation may result in sustained release forms of the protein which have the effect 
of continuous presence in the bloodstream, in predictable amounts, based on a 
determined dosage regimen. Thus, GDNFR protein products include proteins 
derivatized or otherwise formulated to effectuate such continuous administration. 
Sustained release forms of the GDNFR protein products will be formulated to provide 
1 0 the desired daily or weekly effective dosage. 

It is further contemplated that the GDNFR protein product may be administered 
in a combined form with GDNF. Alternatively, the GDNFR and GDNF protein 
products may be administered separately, either sequentially or simultaneously. 

GDNFR protein product of the present invention may also be employed, alone 
15 or in combination with other growth factors in the treatment of nerve disease. In 
addition, other factors or other molecules, including chemical compositions, may be 
employed together with a GDNFR protein product. In the treatment of Parkinson's 
Disease, it is contemplated that GDNFR protein product be used by itself or in 
conjunction with the administration of Levodopa, wherein the GDNFR would enhance 
20 the activity of endogenous GDNF and thereby enhance the neuronal uptake of the 
increased concentration of dopamine. 

As stated above, it is also contemplated that additional neurotrophic or neuron 
nurturing factors will be useful or necessary to treat some neuronal cell populations or 
some types of injury or disease. Other factors that may be used in conjunction with 
25 GDNFR or a combination of GDNFR and GDNF include, but are not limited to: 
mitogens such as insulin, insulin-like growth factors, epidermal growth factor, 
vasoactive growth factor, pituitary adenylate cyclase activating polypeptide, interferon 
and somatostatin; neurotrophic factors such as nerve growth factor, brain derived 
neurotrophic factor, neurotrophin-3, neurotrophin-4/5, neurotrophin-6, insulin-like 
30 growth factor, ciliary neurotrophic factor, acidic and basic fibroblast growth factors, 
fibroblast growth factor-5, transforming growth factor-fl, cocaine-amphetamine 
regulated transcript (CART); and other growth factors such as epidermal growth 
factor, leukemia inhibitory factor, interleukins, interferons, and colony stimulating 
factors; as well as molecules and materials which are the functional equivalents to these 
35 factors. 
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GDNFR protein product cell therapy, e.g., intracerebral implantation of cells 
producing GDNFR protein product, is also contemplated. This embodiment would 
involve implanting into patients cells capable of synthesizing and secreting a 
biologically active form of GDNFR protein product. Such GDNFR protein product- 
5 producing cells may be cells that are natural producers of GDNFR protein product or 
may be recombinant cells whose ability to produce GDNFR protein product has been 
augmented by transformation with a gene encoding the desired GDNFR protein 
product. Such a modification may be accomplished by means of a vector suitable for 
delivering the gene as well as promoting its expression and secretion. In order to 

1 0 minimize a potential immunological reaction in patients being administered a GDNFR 
protein product of a foreign species, it is preferred that the natural cells producing 
GDNFR protein product be of human origin and produce human GDNFR protein 
product. Likewise, it is preferred that the recombinant cells producing GDNFR 
protein product be transformed with an expression vector containing a gene encoding a 

1 5 human GDNFR protein product. 

Implanted cells may be encapsulated to avoid infiltration of surrounding tissue. 
Human or non-human animal cells may be implanted in patients in biocompatible, 
semipermeable polymeric enclosures or membranes that allow release of GDNFR 
protein product, but that prevent destruction of the cells by the patient's immune 

20 system or by other detrimental factors from the surrounding tissue. Alternatively, the 
patient's own cells, transformed to produce GDNFR protein product ex vivo, could be 
implanted directly into the patient without such encapsulation. 

Techniques for the encapsulation of living cells are familiar to those of ordinary 
skill in the art, and the preparation of the encapsulated cells and their implantation in 

25 patients may be accomplished without undue experimentation. For example, Baetge et 
al. (International Publication No. WO 95/05452; International Application No. 
PCT/US94/09299 the disclosure of which is hereby incorporated by reference) 
describe biocompatible capsules containing genetically engineered cells for the 
effective delivery of biologically active molecules. In addition, see U.S. Patent 

30 Numbers 4,892,538, 5,01 1,472, and 5,106,627, each of which is specifically 

incorporated herein by reference. A system for encapsulating living cells is described 
in PCT Application WO 91/10425 of Aebischer et al., specifically incorporated herein 
by reference. See also, PCT Application WO 91/10470 of Aebischer et al., Winn et 
al., Exper. Neurol., 113:322-329, 1991, Aebischer et al., Exper. Neurol., 111:269- 

35 275, 1991; Tresco et al, ASAIO, 38:17-23, 1992, each of which is specifically 
incorporated herein by reference. 

In vivo and in vitro gene therapy delivery of GDNFR protein product is also 
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envisioned. Id vitro gene therapy may be accomplished by introducing the gene 
coding for GDNFR protein product into targeted cells via local injection of a nucleic 
acid construct or other appropriate delivery vectors. (Hefti, J. Neurobiol,. 25: 141 8- 
1435, 1994). For example, a nucleic acid sequence encoding a GDNFR protein 
product may be contained in an adeno-associated virus vector for delivery into the 
targeted cells (e.g., Johnson, International Publication No. WO 95/34670; 
International Application No. PCT/US95/07178 the disclosure of which is' hereby 
incorporated by reference). Alternative viral vectors include, but are not limited to 
retrovirus, adenovirus, herpes simplex virus and papilloma virus vectors. Physical 
transfer, either in vivo or ex vivo as appropriate, may also be achieved by liposome- 
mediated transfer, direct injection (naked DNA), receptor-mediated transfer (ligand- 
DNA complex), electroporation, calcium phosphate precipitation or microparticle 
bombardment (gene gun). 

It is also contemplated that GDNFR protein product gene therapy or cell 
15 therapy can further include the delivery of GDNF protein product. For example, the 
host cell may be modified to express and release both GDNFR protein product and 
GDNF protein product. Alternatively, the GDNFR and GDNF protein products may 
be expressed in and released from separate cells. Such cells may be separately 
introduced into the patient or the cells may be contained in a single implantable device, 
20 such as the encapsulating membrane described above. 

It should be noted that the GDNFR protein product formulations described 
herein may be used for veterinary as well as human applications and that the term 
"patient" should not be construed in a limiting manner. In the case of veterinary 
25 applications, the dosage ranges may be determined as described above. 
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EXAMPLES 



Example 1 

Identification of Cells Expressing High Affinity GDNF Binding Sites 

Expression cloning involved the selection of a source of mRNA which is likely 
to contain significant levels of the target transcript. Retina photoreceptor cells were 
identified as responsive to GDNF at very low concentrations, suggesting the existence 
of a functional, high affinity receptor. To confirm that rat photoreceptor cells did 
express a high affinity receptor for GDNF, [125 I]GD NF binding and photographic 
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emulsion analysis were carried out. 



10 



Rat Retinal Cell Culnny 

The neural retinas of 5-day-old C57B1/6 mouse pups or 3-day-old Sprague- 
Dawley rai pups (Jackson Laboratories, Bar Harbor, MA) were carefully removed and 
dissected free of the pigment epithelium, cut into 1 iW fragments and placed into ice- 
cold phosphate-buffered saline (PBS). The retinas were then transferred into 10 mL 
of Hank's balanced salt solution (HBSS) containing 120 units papain and 2000 units 
DNAase and incubated for 20 minutes at 37°C on a rotary platform shaker at about 
200 rpm. The cells were then dispersed by trituration through fire-polished Pasteur 
pipettes, sieved through a 20 urn Nitex nylon mesh and centrifuged for five minutes at 
200 x g . The resulting cell pellet was resuspended into HBSS containing 
1% ovalbumin and 500 units DNAase, layered on top of a 4 % ovalbumin solution (in 
HBSS) and centrifuged for 10 minutes at 500 x g. The final pellet was resuspended in 
15 complete culture medium (see below), adjusted to about 15,000 cells/mL, and seeded 
in 90 ul aliquots into tissue culture plates coated with polyornithine and larninin as 
previously described (Louis et al., Journal Of Pharmacology And Experimental 
Therapeutics, 262, 1274-1283, 1992). 

The culture medium consisted of a 1:1 mixture of Dulbecco's Modified Eagle's 
20 Medium (DMEM) and Fl 2 medium, and was supplemented with 2.5% heat- 
inactivated horse serum (Hyclone, Logan, UT), B27 medium supplement (GIBCO, 
Grand Island, NY), D-glucose (final concentration: 5mg/mL), L-glutamine (final 
concentration: 2mM), 20 mM HEPES, bovine insulin and human transferrin (final 
concentrations: 2.5 and 0.1 mg/mL, respectively). 



25 



Immunocvtochemical identifi^tiffn nhotnnrap^ 

Photoreceptors were identified by immunostaining for arrestin, a rod-specific 
antigen. Cultures of photoreceptors were fixed for 30 minutes at room temperature 
with 4% paraformaldehyde in PBS, pH 7.4, followed by three washes in PBS. The 

30 fixed cultures were then incubated in Superblock blocking buffer (Pierce, Rockford, 
IL) , containing 1% Nonidet P-40 to increase the penetration of the antibodies. The' 
anti-arrestin antibodies (polyclonal rabbit antibody against the synthetic peptide 
sequence of arrestin: Val-Phe-Glu-Glu-Phe-Ala-Arg-Gln-Asn-Leu-Lys-Cys) were 
then applied at a dilution of between 1 :2000 in the same buffer, and the cultures were 

35 incubated for one hour at 37°C on a rotary shaker. After three washes with PBS, the 
cultures were incubated for one hour at 37°C with goat-anti-rabbit IgG (Vectastain kit 
from Vector Laboratories, Burlingame, CA) at a 1 :500 dilution. After three washes 
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with PBS, the secondary antibodies were then labeled with an avidin-biotin-peroxidase 
complex diluted at 1 :500 (45 minutes at 37°C). After three more washes with PBS 
the labeled cell cultures were reacted for 5-20 minutes in a solution of 0 1 M Tris-HCl 
pH 7.4, containing 0.04% 3^diammober*idme-(HCl)4, 0.06 percent NiCI 2 and ' 
5 0.02 percent hydrogen peroxide. Based on arrestin-irnmunoreactivity, about 90% of 
the cells in the cultures were rod photoreceptors. 

The survival of photoreceptors was determined by examination of arrestin- 
stained cultures with bright-light optics at 200X magnification. The number of 
arrestin-positive photoreceptors was counted in one diametrical 1 X 6 mm strip 
10 representing about 20 percent of the total surface area of a 6 mra-well. Viable 
photoreceptors were characterized as having a regularly-shaped cell body, with a 
usually short axon-like process. Photoreceptors showing signs of degeneration, such 
as having irregular, vacuolated perikarya or fragmented neurites, were excluded from 
the counts (most of the degenerating photoreceptors, however, detached from the 
15 culture substratum). Cell numbers were expressed either as cells/6-mm well. 

Cultured rat retinal cells enriched for photoreceptors (10,000/6-mm well) were 
treated with human recombinant GDNF (ten-fold serial dilutions ranging from 
10 ng/mL to 1 pg/mL). The cultures were fixed after six days and immunostained for 

20 arrestm, a rod photoreceptor-specific antigen. In cultures that were not treated with 
GDNF, the number of photoreceptors declined steadily over time to reach about 
25 percent of the initial number after six days in culture. Treatment of the cultures 
with GDNF resulted in an about two-fold higher number of viable arrestin-positive 
photoreceptors after six days in culture. The effect of GDNF was maximal at about 

25 200 pg/mL, with an ED 50 of about 30 pg/mL. In addition to promoting photoreceptor 
survival, the addition of the GDNF also stimulated the extension of their axon-like 
process, thereby demonstrating an effect on the morphological development of the 
photoreceptors (mean neurite length of photoreceptors in GDNF: 68 urn, compared to 
27± 18 um in control cultures). 

10 In order to confirm that rat retinal cells express high affinity GDNF receptors 

[ I]GDNF binding and photographic emulsion analysis were carried out Post-natal 
rat photoreceptor cells were seeded on plastic slide flaskettes (Nunc) at a density of 
2800 cells/mm2, three to four days before the experiments. The cells were washed 
once with ice-cold washing buffer (Dulbecco's Modified Eagle's Medium (DMEM) 

5 containing 25 mM N-2-hydroxyethylpiperazine-N'-2-ethanesulfonic acid (HEPES), 
pH 7.5). For competitive binding, the cells were incubated with various 
concentrations of [ 125 I]GDNF m 5inding buffer (DM£M 25 ^ 
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pH 7.5, and 2 mg/mL of bovine serum albumin (BSA)) in the presence or absence of 
500 nM unlabeled GDNF at 4°C for four hours. Cells were washed four times with 
ice-cold washing buffer, lysed in 1 M NaOH and the radioactivity associated with the 
cells was determined in a gamma counter. A significant amount of [ 125 I]GDNF 
5 bound to the photoreceptor ceils even ai low iigand concentrations (as low as 30 pM), 
and this binding was inhibited completely by the presence of excess unlabeled GDNF. 

For photographic emulsion detection, cells were incubated with 50 pM of 
[ 125 I]GDNF in binding buffer in the presence or absence of 500 nM unlabeled GDNF 
at 4°C for four hours. Cells were washed six times with ice-cold washing buffer, 

10 fixed with 2.5% glutaraldehyde and dehydrated sequentially with 50% and 70% 
ethanol, and dipped in NTB-2 photographic emulsion (Eastman Kodak, Rochester 
NY). After five days of exposure, the slides were developed and examined. The 
photographic emulsion analysis demonstrated the association of [ 125 I]GDNF to some 
of the photoreceptor cells, thereby indicating the presence of a receptor for GDNF. 

1 5 This association, however, was efficiently blocked by the addition of unlabeled 
GDNF. 



Example 2 

20 Expression Cloning of a GDNFR from Photoreceptor Cells 

Rat photoreceptor cells were selected as a possible source of a high affinity 
receptor for GDNF based upon their cell surface binding of radiolabeled GDNF and 
their ability to respond to very low concentrations of the ligand, as described in 

25 Example 1 . In order to identify the receptor, a size-selected cDNA library of 
approximately 50,000 independent clones was constructed using a mammalian 
expression vector (a derivative of pSR, Takebe et al., 1988 supra) and mRNA isolated 
from cultured post-natal rat photoreceptor cells, by the methods described below. The 
library was divided into pools of approximately 1,500 to 2,000 independent clones 

30 and screened using an established expression cloning approach (Gearing et al., EMBO 
Journal, 8, 3667-3676, 1989). Plasmid DNA representing each pool of the library 
was prepared and transfected into COS7 cells grown on plastic microscope slide 
flaskettes (Nunc, Naperville, IL). 

The transfected cells were treated with [ 125 I]GDNF, fixed with 

35 glutaraldehyde, dehydrated, and dipped in photographic emulsion for 

autoradiography. Following exposure for five days, the slides were developed and 
examined for the presence of cells covered by silver grains which indicated the binding 
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of [ 125 I]GDNF to the cell surface as a result of the cell's expression of a receptor for 
GDNF. EGF receptor transfected cells treated with [ 12 5l]EGF were used as a 
positive control. 

One of the 27 pools (F8-1 1) screened in this manner exhibited 19 positive cells 
5 following transfection. Thus, a single cDNA library pool was identified which 
contained a cDNA clone that expressed GDNFR, This pool was divided into 60 
smaller subpools of 100 clones/pool which were rescreened by the same procedure 
described above. Five of these pools were identified as positive and two of the five 
pools were further subdivided to yield single clones responsible for the GDNF binding 
10 activity. Transfection of plasmid DNA from the single clones into COS7 cells resulted 
in the binding of [1 25 I]GDNF to approximately 15% of the cells. This binding was 
specifically inhibited by competition with excess unlabeled GDNF. 

Construction of Expression cDNA Libras 
1 5 Rat retinal cells were harvested from postnatal day 3-7 rats and seeded into 

culture dishes coated with laminin and polyomithine at a density of approximately 
5700 cells/mm 2 . After 3-4 days in culture, the population was estimated to contain 
approximately 80% photoreceptor cells. Total RNA was prepared from this culture by 
standard methods, and Poly A+ RNA was purified using a polyA-tract kit (Promega, 

20 Madison, WI). A cDNA library was constructed from the rat photoreceptor poly A+ 
RNA using the Gibco Superscript Choice System (Gibco/BRL, Gaithersburg, MD). 
Two micrograms of poly A+ RNA were mixed with 50 ng of random hexamers, 
heated to 70°C for 10 minutes and then quick-chilled on ice. First strand synthesis 
was carried out with 400U Superscript II RT at 37<>C for one hour. Second strand 

25 synthesis was performed in the same tube after the addition of dNTPs, 10U of E. coli 
DNA ligase, 40U of E. coli DNA polymerase I, and 2U of E. coli RNase H. After 
two hours at 16°C, the cDNA ends were blunted by treatment with I0U of T4 
polymerase for an additional five minutes at 16°C. Following isopropanol 
precipitation, EcoRI cloning sites were added to the cDNA by ligation overnight with 

30 1 0 pg of unphosphorylated EcoRI adapter oligonucleotides. 

The EcoRI adapted cDNA was then phosphoiylated and applied to a Sephacryl 
S-500 HR size fractionation column. Following loading, the column was washed 
with 100 Ml aliquots of TEN buffer (10 mM Tris-HCl pH 7.5, 0.1 mM EDTA, 
25 mM NaCl), and 30 ^il fractions were collected. Fractions 6 through 8, which 

35 contained approximately 34 ng of high molecular weight cDNA, were pooled and 

precipitated. The recovered EcoRI-adapted cDNA was ligated overnight with 50 ng of 
EcoRI cut vector pBJ5. Aliquots of the ligation mix containing about 1 5 ng cDNA 
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each were transformed into competent cells (E. coli strain DH10B; G1BCO/BRL, 
Gaithersburg, MD) by electroporation. The transformation mixture was titered and 
then plated on 27 Arap/LB plates at a density of 1500 colonies/plate. Colonies were 
scraped from each plate and collected into 10 mL of Luria broth (LB) to make 27 pools 

C r^-f 1 Sf\f\ ifi/lafuin/'lAMt /^Iswiac* aaaU A *%ss*<+*s<k«<k ss.f 4-U ...L ^ 1 c 

. J ui *-/w uiuwpvuuvui wvs*iv<? vuwu. fi puiUVU VI UUW llUill CdVU jjlhji waS irozcn LD 

glycerol and the remainder was used to isolate plasmid DNA using a Qiagen tip-500 kit 
(Qiagen Inc., Chatsworth, CA). 

COS Cell Transfection and Photographic Emulsi on Analysis 

1 0 COS7 cells were seeded (220,000 cells/slide) on plastic slide flaskettes (Nunc) 

coated with ProNectin (10 jig/mL in phosphate buffered saline (PBS)) one day before 
transfection. For transfection, 700 ^1 of Opti MENU (GIBCO/BRL, Gaithersburg, 
MD) containing 2 ng cDNA was mixed gently with 35 \i\ of DEAE Dextran solution 
(10 mg/mL, Sigma, St. Louis, MO) in an Eppendorf tube. Cells were washed twice 

15 with PBS and incubated with the transfection mix for 30 minutes at 37°C in a 5% CO2 

atmosphere. Following incubation, 3 mL of DMEM media containing 10% fetal calf 
serum (FCS) and 80 nM Chloroquine (Sigma, St. Louis, MO) were added to each 
flaskette. Cells were further incubated for 3.5 hours, shocked with 10% 
dimethylsulfoxide in DMEM at room temperature for two minutes, washed once with 

20 PBS, and allowed to grow in DMEM containing 10% FCS. After 48 hours, the 
transfected COS7 cells were washed once with ice-cold washing buffer (DMEM 
containing 25 mM HEPES, pH 7.5) and incubated in ice-cold binding buffer (DMEM 
containing 25 mM HEPES, pH 7.5 and 2 mg/mL BSA) supplemented with 50 pM 
[125j]GDNF at 4°C for four hours. Cells were washed six times in ice-cold washing 

25 buffer, fixed with 2.5% glutaraldehyde at room temperature for five minutes, 
dehydrated sequentially with 50% and 70% ethanol, and then dipped in NTB-2 
photographic emulsion (Eastman Kodak). After 4-5 day exposure at 4°C in dark, the 
slides were developed and screened by bright-field and dark-field microscopy. 

30 Subdivision of Positive Pools 

A single pool was identified which contained a putative GDNF receptor clone. 
Clones from this pool were plated on 60 plates at a density of 100 colonies/plate. 
Cells were scraped from each plate, collected in LB, and allowed to grow for 4-5 
hours at 37°C. Frozen stocks and DNA preparations were made from each pool, as 

35 before, to generate 60 subpools containing 100 independent clones each. Two of 
these 60 subpools were identified as positive by the method described above, and 
clones from those pools were plated at low density to allow isolation of single 
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colonies. Single colonies (384) were picked from each of the two subpools and 
grown for six hours in 200 fji LB in 96-well plates. In order to select single clones 
expressing GDNFR, the four 96-well plates were arrayed into a single large matrix 
consisting of 16 rows and 24 columns. Cells from the wells in each row and in each 

5 column were combined to yield a total of 40 mixtures. These mixtures were grown 
overnight in 10 mL LB/Amp (100 ^ig/mL), and DNA was prepared using a Qiagen tip- 
20 kit. When analyzed for putative GDNF receptor clones, three row mixtures and 
three column mixtures gave positive signals, suggesting nine potentially positive single 
clones. DNA from each of the potentially positive single clones was prepared and 

10 digested with EcoRI and Pstl. DNA from three of the nine single clones exhibited 
identical restriction patterns while the other six were unrelated, suggesting that the 
three represented the authentic clones containing GDNFR. 

15 Example 3 

DNA Sequencing and Sequence Analysis 

DNA from positive, single clones was prepared and sequenced using an 
automated ABI373A DNA sequencer (Perkin/Elmer Applied Biosystems, Santa Clara, 

20 CA ) and dideoxy-dye-terminators, according to manufacturer's instructions. 
Comparison of GDNF receptor sequence with all available public databases was 
performed using the FASTA (Pearson and Lipman, Proceedings Of The National 
Academy Of Sciences U.S.A., 85, 2444-2448, 1988) program algorithm as described 
in the University of Wisconsin Genetics Computer Group package (Program Manual 

25 for the Wisconsin Package, Version 8, September 1994, Genetics Computer Group, 
Madison, Wisconsin). 

Sequence Characterization of the Rat GDNFR 

Plasmid DNA from the clones described in Example 2, above, was prepared 
30 and submitted for DNA sequence analysis. Nucleotide sequence analysis of the cloned 

2138 bp rat cDNA revealed a single large open reading frame encoding a translation 

protein of 468 amino acid residues (Figure 3). 

The coding sequence is flanked by a 5' -untranslated region of 301 bp and a 3'- 

untranslated region of 430 bp that does not contain a potential polyadenylation site. 
35 The presence of an in-frame stop codon upstream of the first ATG at base pair 302 and 

its surrounding nucleotide context indicate that this methionine codon is the most likely 

translation initiator site (Kozak, Nucleic Acids Research. 15, 8125-8148, 1987). 
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No polyadenylation signal is found in the 430 nucleotides of 3' untranslated 
sequence in the rat cDNA clone. This is not surprising, since the Northern blot data 
shows the shortest mRNA transcripts to be approximately 3.6 kb. 

The GDNFR polypeptide sequence has an N-terminal hydrophobic region of 
approximately 19 residues (methionine-1 to alanine-19, Figure 3) with the 
characteristics of a secretory signal peptide (von Heijne, Protein Sequences And Data 
Analysis. 1, 41-42, 1987; von Heijne, Nucleic Acids Research. 14, 4683-4690, 
1986). No internal hydrophobic domain that could serve as a transmembrane domain 
was found. Instead, a carboxy-terminal hydrophobic region of 21 residues (leucine- 
448 to serine-468 in Figure 3) is present and may be involved in a glycosyl- 
phosphatidylinositol (GPI) anchorage of the receptor to the cytoplasmic membrane. 
Except for the presence of three potential N-linked glycosylation sites, no conserved 
sequence or structural motifs were found. The protein is extremely rich in cysteine (3 1 
of the 468 amino acid residues) but their spacing is not shared with that of cysteine- 
rich domains found in the extracellular portions of known receptors. 

The GDNFR sequence was compared to sequences in available public 
databases using FASTA. The search did not reveal significant homology to other 
published sequences. Once the rat cDNA clone was obtained, it was radiolabeled and 
used to probe a cDNA library prepared from human brain substantia nigra as described 
20 below in Example 5. 
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Example 4 

GDNF Binding to Cells Expressing GDNFR 

A binding assay was performed in accordance with an assay method 
previously described by Jing et al.. (Journal Of Cell Biology, 1 10, 283-294, 1990). 
The assay involved the binding of [125 I]GDNF t0 rat photoreceptor ^ C q S7 ce „ s 
or 293T cells which had been transfected to express GDNFR. Recombinant GDNFR 
expressed on the surface of 293T cells was able to bind GDNF specifically and with 
an affinity comparable to that observed for GDNF binding sites on rat retinal cells. 

Rat photoreceptor cells were prepared as described in Example 1, above, and 
seeded at a density of 5.7 x 10* cells/cm2 two to three days before the assay in 24- 
well Costar tissue culture plates pre-coated with polyomithine and laminin. COS7 
cells were seeded at a density of 2.5 x 1()4 cellsW one day before the assay and 
transfected with 10-20 ug of plasmid DNA using the DEAE-dextran-chloroquine 
method (Aruffo and Seed, Proceedings Of The National Academy Of Sciences 
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U.S.A., 84, 8573-8577, 1987). Cells from each dish were removed and reseeded 
into 30 wells of 24-well Costar tissue culture plates 24 hours following the 
transfection, and allowed to grow for an additional 48 hours. Cells were then left on 
ice for 5 to 1 0 minutes, washed once with ice-cold washing buffer and incubated with 
5 0.2 mL of binding buffer containing various concentrations of [125rj GDNF with or 
without unlabeled GDNF at 4<> C for four hours. Cells were washed four times with 
0.5 mL ice-cold washing buffer and lysed with 0.5 mL of 1 M NaOH. The iysates 
were counted in a 1470 Wizard Automatic Gamma Counter. 

For some binding experiments, transiently transfected 293T cells were used 
10 (see below for 293T cell transfection). Two days following transfection, cells were 
removed from dishes by 2x versine. Cells were pelleted, washed once with ice-cold 
binding buffer and resuspended in ice-cold binding buffer at a density of 3 x 10* 
cells/mL. The cell suspension was divided into aliquots containing 1.5 x 10^ 
cell/sample. Cells were then pelleted and incubated with various concentrations of 
15 [125TJGDNF in the presence or absence of 500 nM of unlabeled GDNF at 4<>C for 
four hours with gentle agitation. Cells were washed four times with ice-cold washing 
buffer and resuspended in 0.5 mL washing buffer. Two 0.2 mL aliquots of the 
suspension were counted in a gamma counter to determine the amount of [125 T ] G DNF 
associated with the cells. 

20 In all assays, nonspecific binding was determined by using duplicate samples, 

one of which contained 500 nM of unlabeled GDNF. The level of nonspecific binding 
varied from 10% to 20% of the specific binding measured in the absence of unlabeled 
GDNF and was subtracted from the specific binding. The assays demonstrated that 
cells did not bind GDNF unless the cell had been transfected with the GDNFR cDNA 

25 clone. 
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Example 5 
Tissue Distribution of GDNFR mRNA 



The partem of expression of GDNFR mRNA in embryonic mouse, adult 
mouse, rat, and human tissues was examined by Northern blot analysis. The cloned 
rat GDNFR cDNA was labeled using the Random Primed DNA Labeling Kit 
(Boehringer Mannheim, Indianapolis, IN) according to the manufacturer's procedures. 
Rat, mouse, and human tissue RNA blots (purchased from Clontech, Palo Alto, CA) 
were hybridized with the probe and washed using the reagents of the ExpressHyb Kit 
(Clontech) according to the manufacturer's instructions. 
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Tissue Northern blots prepared from adult rat, mouse, and human tissues 
indicated that GDNFR mRNA is most highly expressed in liver, brain, and kidney 
High mRNA expression was also detected in lung, with lower or non-detectable 

amounts m spleen, intestine, testis, and skeletal muscle. In blots made fi ™ a 

isolated from mouse embryo, expression was undetectable at embryonic day 7 
became apparent at day El 1, and was very high by day E17. GDNFR mRNA was 
expressed m tissue isolated from several subregions of adult human brain at relatively 
equal levels. Expression of GDNFR mRNA in human adult brain showed little 
specificity for any particular region. 

In most tissues, transcripts of two distinct sizes were present In mouse and 
human tissues, transcripts of 8.5 and 4.4 kb were found, while in rat the transcripts 
were 8.5 and 3.6 kb. The relative amounts of the larger and smaller transcripts varied 
with tissue type, the smaller transcript being predominant in liver and kidney and the 
larger being more abundant in brain. The binding of GDNF to 293T cells transfected 
with a GDNFR cDNA clone in the pBKRSV vector was examined by Scatchard 
analysis. Two classes of binding sites were detected, one with a binding affinity in the 
low picomolar range and another with an affinity of about 500 pM. 
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Example 6 
Recombinant Human GDNFR 



An adult human substantia nigra cDNA library (S'-stretch plus cDNA library 
Clontech, Palo Alto, CA) cloned in bacteriophage gtlO was screened using the rat 

25 GDNFR cDNA clone of Example 1 as a probe. The probe was labeled with [32pj. 
dNTPs using a Random Primed DNA Labeling Kit (Boehringer Mannheim, 
Indianapolis, IN) according to the manufacturer's instructions. Approximately 1.2 x 
10 gtlO phage from the human substantia nigra cDNA library were plated on 15 cm 
agarose plates and replicated on duplicate nitrocellulose filters. The filters were then 

30 screened by hybridization with the radiolabeled probe. The filters were prehybridized 
in 200 mL of 6 x SSC, 1 x Denhardts, 0.5% SDS, 50 ug/mL salmon sperm DNA at 
55o C for 3.5 hours. Following the addition of 2 x 10* cpm of the radiolabeled probe 
hybndi 2 at.on was continued for 1 8 hours. Filters were then washed twice for 30 
minutes each in 0.5x SSC, 0.1% SDS at 5 5 o C and exposed to X-ray film overnight 
35 with an intensifying screen. 

Five positive plaques were isolated whose cDNA inserts represented portions 
of the human GDNFR cDNA. In comparison to the nucleic acid sequence of rat 
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GDNFR depicted in Figure 3 (bp 0 through 2140), the five human GDNFR clones 
were found to contain the following sequences: 



An alignment and comparison of the sequences, as depicted in Figure 5, provided a 
consensus sequence for human GDNFR. The translation product predicted by the 
human cDNA sequence consists of 465 amino acids and is 93% identical to rat 
GDNFR. 

To generate a human cDNA encoding the full length GDNFR, portions of 
clones 21 B and 2 were spliced together at an internal Bglll site and subcloned into the 
mammalian expression vector pBKRSV (Stratagene, La Jolla, CA). 

Recombinant human GDNFR expression vectors may be prepared for 
expression in mammalian cells. As indicated above, expression may also be in non- 
mammalian cells, such as bacterial cells. The nucleic acid sequences disclosed herein 
may be placed into a commercially available mammalian vector (for example, CEP4, 
Invitrogen) for expression in mammalian cells, including the commercially available 
human embryonic kidney cell line, "293". For expression in bacterial cells, one would 
typically eliminate that portion encoding the leader sequence (e.g., nucleic acids 1 -590 
of Figure 1). One may add an additional methionyl at the N-terminus for bacterial 
expression. Additionally, one may substitute the native leader sequence with a 
different leader sequence, or other sequence for cleavage for ease of expression. 



Soluble human GDNFR protein products were made. The following examples 
provide four different forms, differing only at the carboxy terminus, indicated by 
residue numbering as provided in Figure 2. Two are soluble forms truncated at 
different points just upstream from the hydrophobic tail and downstream from the last 



TABLE 3 



Clone 21 -A 
Clone 21-B 
Clone 29 



Clone 2 
Clone 9 



1247 through 2330 (SEQIDNO:21) 
1270 through 2330 (SEQ ID NO:23) 
-235 through 1692 (SEQ ID NO:9) 
-237 through 1692 (SEQ ID NO:l 1) 
805 through 2971 (SEQIDNO:15) 



Example 7 
Soluble GDNFR Constructs 
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cysteine residue. The other two are the same truncations but with the addition of the 
"FLAG" sequence, an octapeptide to which a commercial antibody is available 
(Eastman Kodak). The FLAG sequence is H2N- DYKDDDDK - COOH. 

5 Method 

Lambda phage clone #21, containing nearly the entire coding region of human 
GDNFR, was digested with EcoRI to excise the cDNA insert. This fragment was 
purified and ligated into EcoRI cut pBKRSV vector (Stratagene, La Jolla, CA) to 
produce the clone 21-B-3/pBKRSV. Primers 1 and 2 as shown below were used in a 

10 PGR reaction with the human GDNFR clone 2 1 -B-3/pBKRS V as template. PCR 
conditions were 94°C, five minutes followed by 25 cycles of 94°C, one minute; 
55°C, one minute; 72°C, two minutes and a final extension of five minutes at 72°C. 
This produced a fragment consisting of nucleotides 1265-1868 of the human GDNFR 
clone plus a termination codon and Hind III restriction site provided by primer 2. This 

15 fragment was digested with restriction enzymes Hind HI (contained in primer 2) and 
Bglll (position 1304 in human GDNFR), and the resulting 572 nucleotide fragment 
was isolated by gel electrophoresis. This fragment contained the hGDNFR- coding 
region from isoleucine-255 to glycine-443. A similar strategy was used with primers 
1 and 3 to produce a fragment with Bglll and Hindlll ends which coded for 

20 isoleucine-255 to proline-446. Primers 4 and 5 were designed to produce fragments 
coding for the same regions of hGDNFR and primers 1 and 3, but with the addition of 
the Flag peptide coding sequence (IBI/Kodak, New Haven, CN). The Flag peptide 
sequence consists of eight amino acids (H2N-Asp-Tyr-Lys- Asp-Asp- Asp- Asp-Lys- 
COOH) to which antibodies are commercially available. Primers 1 and 4 or 1 and 5 

25 were used in PCR reactions with the same template as before, and digested with 
Hindlll and Bglll as before. This procedure produced fragments coding for 
isoleucine-255 to glycine-443 and isoleucine-255 to proline-446, but with the addition 
of the Flag peptide at their carboxy termini. 

30 Primers 

1 ) 5 ' - CTGTTTGAATTTGCAGGACTC - 3 1 ( SEQ ID NO : 3 0 ) 

2) 5 ' - C TC C TC TC T AAGC TTC T AAC C AC AGC TTGGAGG AGC ■ 3 ' (SEQ ID NO: 31) 

3 ) 5 ' - CTCCTCTCTAAGCTTCTATGGGCTCAGACCACAGCTT • 3 » ( SEQ ID NO : 32 ) 

4 ) 5 ' • CTCCTCTCTAAGCTTCTACTTGTCATCGTCGTCCTTGTAGTCACCACAGCTTGGA 
35 GGAGC * 3 ' (SEQ ID NO: 33) 

5 ) 5 ' - CTCCTCTCTAAGCTTCTACTTGTCATCGTCGTCCTTGTAGTCTGGCTCAGACCAC 
AGCTT • 3 ' (SEQ ID NO: 34) 
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All four fragments, produced as described above, were transferred back into 
21B3/pBKRSV. The 21B3/pBKRSV clone was digested with Bglll and HindlTI, and 
treated with calf intestinal alkaline phosphatase (CIAP). The large fragment containing 
5 the vector and the human GDNFR coding region up to the Bglll site was gel purified 
and extracted from gel. Each of the four Bglll/Hindlll fragments produced as 
described above were ligated into this vector resulting in the following constructs in 
the pBKRSV vector 



TABLE 4 



1) 


GDNFR/gly-443/pBKRSV 


hGDNFR terminating at glycine 443, followed 






by stop codon 


2) 


GDNFR/pro-446/pBKRSV 


hGDNFR terminating at proline 446, followed 






by stop codon 


3) 


GDNFR/gly- 


hGDNFR terminating at glycine 443 with C- 




443/Flag/pBKRSV 


term Flag tag, followed by stop codon 


4) 


GDNFR/pro- 


hGDNFR terminating at proline 446 with C- 




446/Flag/pBKRSV 


term Flag tag, followed by stop codon 



Correct construction of all clones was confirmed by DNA sequencing. Inserts 
from the pBKRSV clones were transferred to other expression vectors using enzyme 
15 sites present in the pBKRSV polylinker sequence as described below. Soluble 
GDNFRs (e.g., sGDNFR/gly and sGDNFR/pro) have also been transferred into 
vectors for transient expression and into pDSR-2 for stable expression in CHO cells. 

P PSRa 2+PL clones: 

20 The appropriate pBKRSV clone is digested with Xbal and Sail. The insert is 

ligated to pDSRa2+PL cut with the same enzymes and treated with CIAP. This 
construction may be used for stable expression of GDNFR in CHO cells. 

PCEP4 clones; 

25 The appropriate pBKRSV clone is digested with Spel and Xhol. The insert is 

ligated to pCEP4 (Invitrogen, San Diego, CA) digested with Nhel (Spel ends) and 
Xhol, and treated with CIAP. This construction may be used for transient of 
expression of GDNFR. 
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The plasmid construct pDSR-2 is prepared substantially in accordance with the 
process described in the co-owned and copending U. S. Patent Application Serial 
Number 501 ,904 filed March 29, 1990 (also see, European Patent Application No. 
90305433, Publication No. EP 398 753, filed May 18, 1990 and WO 90/14363 
5 (1990), the disclosures of which are hereby incorporated by reference. It will be 
appreciated by those skilled in the art that a variety of nucleic acid sequences encoding 
GDNFR analogs may be used. 

Another construct is pDSRa2, a derivative of the plasmid pCD (Okayama & 
Berg, Mol. Cell Biol. 3: 280-289, 1983) with three main modifications: (i) the SV40 

10 polyadenylation signal has been replaced with the signal from the a-subunit of bovine 
follicular stimulating hormone, ct-bFSH (Goodwin et al., Nucleic Acids Res. 1 1 : 
6873-6882, 1983); (ii) a mouse dihydrofolate reductase minigene (Gasser et al., Proc. 
Natl. Acad. Sci. 79: 6522-6526, 1982) has been inserted downstream from the 
expression cassette to allow selection and amplification of the trans formants; and (iii) a 

15 267 bp fragment containing the "R-element" and part of the M U5 M sequences of the 
long terminal repeat (LTR) of human T-cell leukemia virus type I (HTLV-I) has been 
cloned and inserted between the SV40 promoter and the splice signals as described 
previously (Takebe et al., Mol. Cell Biol. 8: 466-472, 1988). 

The expression of GDNFR in CHO cells has been verified by the binding of 

20 iodinated GDNF to the cell surface. As discussed above, the recombinantly expressed 
soluble GDNFR protein product may be used to potentiate the activity or cell 
specificity of GDNF. Soluble GDNFR attached to a detectable label also may be used 
in diagnostic applications as discussed above. 



25 

Example 8 

Chemical Crosslinking of GDNF with GDNFR 



In order to study its binding properties and molecular characteristics, GDNFR 
30 was transiently expressed on the surface of 293T cells by transfection of the rat cDNA 
clone. Transfection of 293T cells was performed using the Calcium Phosphate 
Transfection System (GIBCO/BRL, Gaithersburg, MD) according to the 
manufacturers instructions. Two days following transfection, cells were removed by 
2x versine treatment, washed once with washing buffer, and resuspended in washing 
35 buffer at a density of 2 x 10*> cells/mL. A duplicate set of cells were incubated with 
0.5 u/mL PI-PLC at 37°C for 30 minutes before [ 125 I]GDNF binding. These cells 
were washed three times with ice-cold binding buffer and then incubated with 1 to 3 
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nM of [ 1 25i]QDNF along with other cells at 4<>C for four hours. Cells were washed 
four times with ice-cold washing buffer, resuspended in washing buffer supplemented 
with 1 mM of Bis suberate for crosslinking (BS 3 Pierce, Rockford, IL) and incubated 
at room temperature for 30 minutes. Following three washes with TBS, a duplicate 
5 group of samples was treated by 0.5 u/mL of PI-PLC at 37<>C for 30 minutes. These 
cells were pelleted and the supematants were collected. The cells were then washed 
with washing buffer and lysed along with all other cells with 2x SDS-PAGE sample 
buffer. The cell lysates and the collected supematants were resolved on a 7.5% SDS- 
PAGE. 

1 0 The cell suspension was divided into aliquots containing 1 .5 x 1 cell/sample. 

Cells were then pelleted and incubated with various concentrations of [ 125 I]GDNF in 
the presence or absence of 500 nM of unlabeled GDNF at 4°C for four hours with 
gentle agitation. Cells were washed four times with ice-cold washing buffer and 
resuspended in 0.5 mL washing buffer. Two 0.2 mL aliquots of the suspension were 

1 5 counted in a gamma counter to determine the amount of [ 1 2 5rjGDNF associated with 
the cells. 

Although mock transfected 293T cells did not exhibit any GDNF binding 
capacity, GDNFR transfected cells bound [125rjGDNF strongly even at picomolar 
concentrations. This binding was almost completely inhibited by 500 nM of unlabeled 

20 GDNF, indicating a specific binding of native GDNF to the expressed receptors. 
GDNFR expressed by the 293T cells can be released from the cells by 
treatment with phosphatidylinositol-specific phospholipase C (PI-PLC, Boehringer 
Mannheim, Indianapolis, IN). The treatment of transfected cells with PI-PLC prior to 
ligand binding almost entirely eliminated the GDNF binding capacity of the cell. 

25 Additionally, treatment of the transfected cells after cross-linking released the majority 
of the cross-linked products into the media. These results strongly suggest that 
GDNFR is anchored to the cell membrane through a GPI linkage. 

Crosslinking data further indicated that the molecular weight of GDNFR is 
approximately 50-65 kD, suggesting that there is a low level of glycosylation. 

30 Although the major cross-linked species has a molecular mass consistent with a 

monomer of the receptor, a minor species with approximately the mass expected for a 
dimer has been found. 



35 



Example 9 

GDNF Signaling is Mediated by a Complex of GDNFR 
and the Ret Receptor Protein Tyrosine Kinase 
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Introduction 

Mice carrying targeted null mutations in the GDNF gene exhibit various defects 
5 in tissues derived from neural crest cells, in the autonomic nervous system, and in 
trigeminal and spinal cord motor neurons. The most severe defects are the absence of 
kidneys and complete loss of enteric neurons in digestive tract. The phenotype of 
GDNF knockout mice is strikingly similar to that of the c-ret knockout animals 
(Schuchardt et al. 1994), suggesting a possible linkage between the signal transduction 

10 pathways of GDNF and c-ret. 

The proto-oncogene c-ret was identified using probes derived from an 
oncogene isolated in gene transfer experiments (Takahashi et al., Cell. 42, 581-588, 
1985; Takahashi and Cooper, Mol. Cell. Biol., 7, 1378-1385, 1987). Sequence 
analysis of the c-ret cDNA revealed a large open reading frame encoding a novel 

1 5 receptor protein tyrosine kinase (PTK). The family of receptor PTKs has been 
grouped into sub-families according to extracellular domain structure and sequence 
homology within the intracellular kinase domain (van der Geer et al., 1994). The 
unique extracellular domain structure of Ret places it outside any other known receptor 
PTK sub- family; it includes a signal peptide, a cadherin-like motif, and a cysteine-rich 

20 region (van Heyningen, Nature, 367, 319-320, 1994; Iwamoto et al., 1993). In situ 
hybridization and immunohistochemical analysis showed high level expression of ret 
mRNA and protein in the developing central and peripheral nervous systems and in the 
excretory system of the mouse embryo (Pachnis et al, 1993; Tsuzuki et al., 
Oncogene, 10, 191-198, 1 995), suggesting a role of the Ret receptor either in the 

25 development or in the function of these tissues. A functional ligand of the Ret receptor 
has not been identified, thereby limiting a further understanding of the molecular 
mechanism of Ret signaling. Mutations in the c-ret gene are associated with inherited 
predisposition to cancer in familial medullary thyroid carcinoma (FMTC), and multiple 
endocrine neoplasia type 2A (MEN2A) and 2B (MEN2B). These diseases are 

30 probably caused by "gain of function" mutations that constitutively activate the Ret 
kinase (Donis-Keller et al., Hum. Molec. Genet. 2, 851-856, 1993; Hofstra et ah, 
Nature. 367, 375-376, 1994; Mulligan et al M Nature. 363, 458-460, 1993; Santoro et 
al., Science. 267, 381-383, 1995). They confer a predisposition to malignancy 
specifically in tissues derived from the neural crest, where ret is normally expressed in 

35 early development. Another ret-associated genetic disorder, Hirschsprung's disease 
(HSCR), is characterized by the congenital absence of parasympathetic innervation in 
the lower intestinal tract (Edery et al., Nature. 367, 378-380, 1994; Romeo et al., 
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1994). The most likely causes of HSCR are nonsense mutations that result in the 
production of truncated Ret protein lacking a kinase domain or missense mutations that 
inactivate the Ret kinase. As noted above, targeted disruption of the c-ret proto- 
oncogene in mice results in renal agenesis or severe dysgenesis and lack of enteric 
5 neurons throughout the digestive tract (Schuchardt et al., 1994). This phenotype 
closely resembles that of GDNF knockout mice. Taken together, these data suggest 
that both Ret and GDNF are involved in signal transduction pathways critical to the 
development of the kidney and the enteric nervous system. How Ret and GDNF are 
involved, however, was not known. 

1 0 The isolation and characterization of cDNA for GDNFR by expression 

cloning, as described above, lead to the expression of GDNFR in the transformed 
human embryonic kidney cell line 293T. Transformation resulted in the appearance of 
both high (Kd of approximately 2 pM) and low (Kj of approximately 200 pM) affinity 
binding sites. The high affinity binding sites could be composed of homodimers or 

15 homo-oligomers of GDNFR alone, or of heterodimers or hetero-oligomers of GDNFR 
with other molecules. As discussed above, because GDNFR lacks a cytoplasmic 
domain, it must function through one or more accessory molecules in order to play a 
role in GDNF signal transduction. In this study we confirm that, in the presence of 
GDNFR, GDNF associates with the Ret protein tyrosine kinase receptor, and quickly 
20 induces Ret autophosphorylation. 



Results 

Neurp-2a Cells Expressing ODNFR Bind gtwf wit h Hiph Affinity 

Neuro-2a is a mouse neuroblastoma cell line that endogenously expresses a 
high level of Ret protein (Bceda et al., Oncogene. 5, 1291-1296, 1990; Iwamoto et al., 
Oncogene. 8, 1087-1091, 1993; Takahashi and Cooper, 1987) but does not express 
detectable levels of GDNFR mRNA as judged by Northern blot. In order to determine 
if Ret could associate with GDNF in the presence of GDNFR, a study was performed 
to examine the binding of [I25t] GDNF to Neurc-2a cells engineered to express 
GDNFR. Neuro-2a cells were transfected with a mammalian expression vector 
containing the rat GDNFR cDNA (such as the expression plasmid described above). 
Three clonal lines, NGR-16, NGR-33, and NGR-38 were tested for their ability to 
bind [!25i]GDNF. The unbound [125j] GDNF was removed at the end of the 
incubation and the amount of radioactivity associated with the cells was determined as 
described in Experimental Procedures. All three lines were able to bind [125t] GD nf 
specifically while parental Neuro-2a cells exhibited little or no [125tj GDNF binding 
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(Figure 6). Binding could be effectively competed by the addition of 500 nM 

unlabeled GDNF, These results demonstrate that Ret receptor expressed on Neuro-2a 

cells is unable to bind GDNF in the absence of GDNFR and are consistent with the 

previous observation that GDNFR is not expressed at appreciable levels in Neuro-2a 
ii- 

Equilibrium binding of [125j]GDNF t0 NGR-38 cells was examined over a 
wide range of ligand concentrations (0.5 pM to 1 nM of [l25j]GDNF in the presence 
or absence of 500 nM of unlabeled GDNF) (see Figure 7A). Following incubation, 
unbound [!25i]GDNF was removed and the radioactivity associated with the cells was 

10 determined as described in Experimental Procedures. Results are depicted in Figure 7: 
(A) Equilibrium binding of [ 125 I]GDNF to NGR-38 cells (circles) and Neuro-2a cells 
(squares) in the presence (open circles and open squares) or absence (filled circles and 
filled squares) of unlabeled GDNF; (B) Scatchard analysis of [ 125 I]GDNF binding to 
NGR-38 cells. Neuro-2a cells exhibited little binding even at a concentration of 1 nM 

15 [!25i]GDNF, and this binding was not affected by the addition of excess unlabeled 
GDNF. Binding to NGR-38 cells was analyzed by Scatchard plot as shown in 
Figure 7B. Two classes of binding sites were detected, one with = 1.5 ± 0.5 pM 
and the other with K<j = 332 ± 53 pM. These dissociation constants are very similar to 

the values obtained for the high and low affinity binding sites in 293T cells transiently 
20 expressing GDNFR, as described above. 

GDNF Associates with Ret in Neuro-2a Cells Expressing GDNFR 

In order to determine if the Ret receptor PTK could associate with GDNF in 
cells expressing GDNFR, a cross-linking experiment was carried out using NGR-38 

25 and parental Neuro-2a cells. NGR-38 cells were incubated with [ 125 I]GDNF, treated 
with cross-linking reagent, then lysed either directly in SDS-PAGE sample buffer or in 
Triton X-100 lysis buffer and further immunoprecipitated with anti-Ret antibody as 
described in the Experimental Procedures. The immunoprecipitates were analyzed by 
SDS-PAGE in the absence (NR) or presence (R) of -mercaptoethanol. Lysates were 

30 treated with Ret specific antibody, immunoprecipitated, and analyzed by SDS-PAGE 
under reducing conditions (see Figure 8, bands are marked as follows: -75 kD, solid 
triangle; -150 kD, open triangle; -185 kD, solid arrow; -250 kD, asterisk; ~400kD, 
open arrow). The most prominent cross-linked species were at -75 kD, and -1 85 kD, 
with less intense bands of -150 kD and -250 kD. A very faint band of -400 kD was 

35 also visible (Figure 8, lane 2). When immunoprecipitates were analyzed by non- 
reducing SDS-PAGE, the -75 kD, -150 and -185 kD bands were present at about the 
same intensity as in the reducing gel, but the amount of the -400 kD band increased 
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dramatically (Figure 8, lane 4). Also becoming more prominent was the band at 
-250 kD. 

Under both reducing and non-reducing conditions, bands of similar molecular 
weight but of greatly reduced intensity were observed when parental Neuro-2a cells 

5 were used instead of NGR-38 (Figure 8, lanes 1 and 3). The -75 kD and -150 kD 
species are likely to represent cross-linked complexes of GDNF and GDNFR, since 
species with identical molecular weights are produced by cross-linking in 293T cells 
that do not express Ret. Furthermore, since the molecular weight of Ret is 1 70 kD, 
any complex including Ret must be of at least this size. 

10 'The fact that these complexes are immunoprecipitated by anti-Ret antibody 

indicates they are products of an association between Ret and the GDNF/GDNFR 
complex which was disrupted under the conditions of the gel analysis. It is envisioned 
that the broad band at -185 kD probably consists of one molecule of Ret (170 kD) 
cross-linked with one molecule of monomelic recombinant GDNF (15 kD), although 

1 5 some dimeric GDNF may be included. The presence of Ret in this species was 

confirmed by a separate experiment in which a band of the same molecular weight was 
observed when unlabeled GDNF was cross-linked to NGR-38 cells and the products 
examined by Western blot with anti-Ret antibody (data not shown). 

The -400 kD band was not reliably identified, partly due to the difficulty in 

20 estimating its molecular weight The fact that it is prominent only under non-reducing 
conditions indicates that it is a disulfide-linked dimer of one or more of the species 
observed under reducing conditions. The most likely explanation is that it represents a 
dimer of the 1 85 kD species, although it may be a mixture of high molecular weight 
complexes consisting of two Ret, one or two GDNFR, and one or two GDNF 

25 molecules. The exact identity of the -250 kD band has not yet been determined. One 
possibility is that it represents cross-linked heterodimers of the -75 kD (GDNF 
+ GDNFR) and -185 kD (GDNF + Ret) complexes. 

GDNF Stimulates Autophosphorvlation of Ret i n Neuro-2a Cells Expressing GDNFR 
30 The ability of the Ret protein tyrosine kinase receptor to associate with GDNF 

in the presence of GDNFR led to the study of GDNF stimulation of the 
autophosphorylation of Ret. NGR-38 cells were treated with GDNF, lysed, and the 
lysates immunoprecipitated with anti-Ret antibody. The immunoprecipitates were 
analyzed by Western blot using an anti-phosphotyrosine antibody as described in the 
35 Experimental Procedures. When NGR-38 cells (Figure 9A, lanes 2-4) were treated 
with purified recombinant GDNF produced in either mammalian (CHO cells; 
Figure 9A, lanes 4) or E. coli cells (Figure 9A, lanes 1, 3), a strong band was 
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observed at 170 kD, indicating autoptaosphorylation of tyrosine residues on the mature 
form of Ret. A much weaker corresponding band was observed in GDNF-treated 
Neuro-2a cells (Figure 9 A, lane 1 ). No phosphorylation was observed on the 
alternatively glycosylated 150 kD precursor form of Ret (Figure 9A). The induction of 
5 Ret autophosphorylation by GDNF was dosage dependent. The dose response and 
kinetics of GDNF-induced Ret tyrosine phosphorylation in NGR-38 cells are shown 
in panels B and C. In all panels, the tyrosine phosphorylated 170 kD Ret bands are 
indicated by solid arrows. The amount of Ret protein loaded in each lane as 
determined by reprobing of the immunoblot with anti-Ret antibody (Santa Cruz, C-19, 

10 Cat. #sc-167) is shown on the right side of panel A. The band at -150 kD represents 
an alternately glycosylated immature form of Ret that does not autophosphorylate. As 
shown in Figure 9B, stimulation of Ret autophosphorylation in NGR-38 cells could be 
detected with 50 pg/mL of GDNF and the response was saturated at 20-50 ng/mL 
GDNF. The stimulation of Ret autophosphorylation by purified recombinant GDNF 

15 in NGR-38 cells over times of 0-20 minutes following treatment is shown in Figure 
9C. Increased levels of Ret autophosphorylation could be observed within one minute 
of GDNF treatment and was maximal at 10 minutes following treatment (Figure 9C). 

GDNF and Soluble GDNFR Induce Ret Autophosphorylation in Neuro-2A Cells 
20 As discussed above, GDNFR is anchored to the cytoplasmic membrane 

through a GPI linkage and can be released by treatment with phosphatidylinositol- 
specific phospholipase C (PI-PLC). When NGR-38 cells were incubated with 
PI-PLC, GDNF-induced receptor autophosphorylation of Ret in these cells was 
abolished (Figure 10A; PI-PLC treated (lane 1) or untreated (lanes 2 and 3) NGR-38 
25 cells were incubated with (lanes 1 and 3) or without (lane 2) GDNF and analyzed for 
Ret autophosphorylation by immunoblotting as described in the Experimental 
Procedures). 

Figure 10B depicts parental Neuro-2a cells treated with (lanes 2,4,6,8) or 
without (lanes 1,3,5,7) GDNF in the presence (lanes 5-8) or absence (lanes 1-4) of 

30 PI-PLC/CM obtained from Neuro-2a or NGR-38 cells, as analyzed for Ret 

autophosphorylation by immunoblotting as described in the Experimental Procedures. 
NGR-38 cells treated with GDNF were used as a positive control. In both panels A 
and B, the autophosphorylated 170 kD Ret bands are marked by solid arrows. When 
conditioned medium containing soluble GDNFR released by PI-PLC treatment 

35 (PI-PLC/CM) of NGR-38 cells was added to parental Neuro-2a cells along with 
GDNF, autophosphorylation of the Ret receptor comparable to that obtained with 
GDNF treatment of NGR-38 cells was observed (Figure 10B, lanes 2 and 8). Only 
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background levels of Ret autophosphorylation were observed when no GDNF was 
added, or when conditioned media derived from PI-PLC treatment of Neuro-2a cells 
was tested (Figure 10B, lanes 3-7). 



5 Ret-Fc Fusion Protein Blocks Ret Phosphorylation Induced hv G DNF and Snlnhl,. 

GPNFR 

To confirm that Ret phosphorylation induced by GDNF in the presence of 
GDNFR is the result of receptor autophosphorylation, a study was performed to 
determine whether a Ret extracellular doniain/Immunoglobulin Fc (Ret-Fc) fusion 
10 protein could block Ret activation. Because of the technical difficulty of blocking the 
large number of GDNF alpha receptors expressed on NGR-38 cells. Ret 
phosphorylation assays were performed using Neuro-2a as the target cell and culture 
media removed from NGR-38 cells treated with PI-PLC as a source of GDNFR. 
Cells were treated with mixtures including various combinations of GDNF 
1 5 (50 ng/mL), media containing soluble GDNFR (e.g., PI-PLC/CM derived from 

NGR-38 cells), and different concentrations of Ret-Fc fusion protein either alone or in 
various combinations as indicated in Figure 1 1 . Neuro-2a cells were treated with 
GDNF, media containing soluble GDNFR, Ret-Fc, or the pre-incubated mixtures. 
The cells were then lysed, and the lysates were analyzed for c-Ret autophosphorylation 
20 by immunoprecipitation using anti-Ret antibody as described in the Experimental 
Procedures. The inimunoprecipitates were analyzed by Western blot using an anti- 
phosphotyrosine antibody. 

The pre-incubated mixture of GDNF and media containing soluble GDNFR 
induced tyrosine phosphorylation of Ret receptors expressed in Neuro-2a at a level 
25 comparable to GDNF-treated NGR-38 control cells (Figure 1 1 , lanes 7 and 2). The 
position of the autophosphorylated 170 kD Ret bands are marked by a solid arrow. 
When Ret-Fc fusion protein was included in the pre-incubated GDNF/GDNFR 
mixture, Ret phosphorylation was inhibited in a dose dependent manner (Figure 1 1, 
lanes 8-10). This indicated that Ret phosphorylation is a result of a GDNF/Ret 
30 interaction mediated by GDNFR. In untreated Neuro-2a cells or in cells treated with 
any combination of GDNF or Ret-Fc fusion protein in the absence of GDNFR, only 
background levels of Ret phosphorylation were observed (Figure 11, lanes 3-6). 

GPNF Indies Aut ophosphorylation of c-RET Expressed in Fmhrvonir 
35 Neurons 

Spinal cord motor neurons are one of the major targets of GDNF action in vivo 
(Henderson et al., Science. 266, 1062-1064, 1994; Li et al., Proceedings Of The 



WO 97/40152 



PCT/US97/06281 

76 W 



National Academy Of Sciences, U.S.A. 92, 9771-9775, 1995; Oppenheim et al., 
Narure. 373, 344-346, 1995; Yan et al., Nature. 373, 341-344, 1995; Zurn et al., 
Neuroreport. 6, 1 13-1 18, 1995). To test the ability of GDNF to induce Ret 
autophosphorylation in these cells, embryonic rat spinal cord motor neurons were 

$ treated with Hanes 0 and A\ nr without /lan^ i i\ / t ^tnytt- /» *, 

v t/ w . t.-^w* vuiiwo * J; AU ug/iuj^ ui-nNr iouowea oy 

lysis of the cells, immunoprecipitation with anti-Ret antibody, and analysis by Western 

blotting with anti-phosphotyrosine antibody as described in the Experimental 

Procedures. In lysates of cells treated with GDNF, a band of tyrosine phosphorylated 

protein with a molecular mass of -170 kD was observed (Figure 12, lane 2). No such 

10 signal was observed with cells treated with binding buffer alone (Figure 12, lane 1 ). 
When the same Western blot filter was stripped and re-probed with anti-Ret antibody 
(i.e., the amount of c-Ret protein loaded in each lane was determined by reprobing the 
immunoblot with the anti-Ret antibody), bands with the same molecular mass and 
similar intensities appeared in both samples (Figure 12, lanes 3 and 4). The 

1 5 phosphotyrosine band in GDNF-treated cells co-migrates with the Ret protein band, 
indicating GDNF stimulated autophosphorylation of Ret. The autophosphorylated 
Ret bands (lanes 1 and 2) and the corresponding protein bands (lanes 3 and 4) were 
marked by a solid arrow. 

20 Discussiop 

Polypeptide growth factors elicit biological effects through binding to their 
cognate cell surface receptors. Receptors can be grouped into several classes based on 
their structure and mechanism of action. These classifications include the protein 

25 tyrosine kinases (PTKs), the serine/threonine kinases, and the cytokine receptors. 
Receptor PTK signaling is initiated by a direct interaction with ligand, which induces 
receptor dimerization or oligomerization that in turn leads to receptor 
autophosphorylation. The activated receptor then recruits and phosphorylates 
intracellular substrates, initiating a cascade of events which culminates in a biological 

30 response (Schlessinger and Ullrich, Neuron 9, 383-391, 1992). In contrast, signal 
transduction by serine/threonine kinase or cytokine receptors often involves formation 
of multi-component receptor complexes in which the ligand binding and signal 
transducing components are distinct. Examples are the TGF- receptor complex, a 
serine/threonine kinase receptor consisting of separate binding (Type II) and signaling 

35 (Type I) components and the CNTF family. CNTF, interleukin-6 (IL-6) and 

leukocyte inhibitory factor (LIF) share the common signaling components, gpl30 
and/or LIFR, in their respective receptor complexes. While the ligand specificity of 
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these complexes is determined by a specific binding subunit to each individual ligand, 
signal transduction requires association of the initial complex of ligand and ligand 
binding subunit with other receptor subunits which cannot bind ligand directly (Ip et 
al., Cell. 69, 1 121-1 132, 1992). In the CNTF receptor complex, the ligand binding 
5 component is CNTF receptor (CNTFR), which like GDNFR, is a GPI-anchored 
membrane protein. The present invention involves the description of the first example 
of a receptor PTK whose autophosphorylation is dependent upon association with a 
separate ligand-specific binding component 

The present study confirms that GDNFR, a GPMinked membrane protein that 

10 binds to GDNF with high affinity, is required for the efficient association of GDNF 
with the Ret receptor PTK. In the absence of GDNFR, GDNF is unable to bind to Ret 
or stimulate Ret receptor autophosphorylation. In the presence of GDNFR, GDNF 
associates with Ret and rapidly induces Ret autophosphorylation in a dose-dependent 
manner. GDNFR is able to function in either membrane bound or soluble forms 

15 (Figure 1 1), as discussed above. GDNF concentrations of 50 pg/mL (1.7 pM) are 
able activate the Ret tyrosine kinase in cells expressing GDNFR. This is consistent 
with the dissociation constant (1 .5 pM) found for the high affinity GDNF binding sites 
on NGR-38 cells. The rapid induction of Ret phosphorylation by GDNF (detectable 
one minute after treatment) and the ability of Ret-Fc to block autophosphorylation 

20 suggest that Ret is being activated directly rather than as a downstream consequence of 
the phosphorylation of some other receptor. 

Cross-linking studies support the hypothesis that efficient association of Ret 
with GDNF depends on GDNFR. Cross-linking of GDNF to Ret in NGR-38 cells 
which express high levels of GDNFR is robust, but in parental Neuro-2a cells cross- 

25 linked products are barely detectable. Although conclusive identification of all the 
cross-linked complexes is difficult, the data clearly demonstrates an association of Ret 
with GDNF that is dependent on the presence of GDNFR, and demonstrates that 
GDNFR is included in some of the cross-linked products. The reason for the presence 
of minor cross-linked species in Neuro-2a cells is not clear. While the expression of 

30 GDNFR mRNA in Neuro-2a cells could not be detected by Northern blot, it is 
possible that GDNFR is expressed at very low levels in these cells. 

The fact that Ret can be activated by GDNF in cultured rat embryonic spinal 
cord motor neurons further demonstrates the biological relevance of the Ret/GDNF 
interaction. These cells are a primary target of GDNF in vivo, and have been shown 

35 to respond to low doses of GDNF in vitro (Henderson et al., 1994). Stimulation of 
Ret phosphorylation was abolished when the motor neuron cells were pre-treated with 
PI-PLC (data not shown), suggesting that the activation of Ret by GDNF requires 
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GDNFR. 

Although binding of ligand to the receptor extracellular domain is the first step 
in the activation of other known receptor PTKs, the present data has shown that this is 
not the case for GDNF and Ret, Figure 13 depicts a model for the binding of GDNF 
5 to GDNFR and Ret, and the consequent activation of the Ret PTK in response to 
GDNF. The initial event in this process is the binding of disulfide-linked dimeric 
GDNF to GDNFR in either monomeric or dimeric form. Although there is currently 
no direct evidence for the existence of dimeric GDNFR, when 293T cells were 
transfected with GDNFR cDNA, two classes of binding sites appeared. The simplest 

10 explanation for this observation is the existence of monomeric and dimeric GDNFR, 
each with its own ligand binding affinity. This is consistent with the finding that 
GDNF binding affinities are apparently unaffected by the presence of Ret. Since the 
present experiments do not address the question of whether dimeric GDNFR is in 
equilibrium with its monomer in the absence of GDNF or if dimerization is induced by 

15 GDNF binding, these possibilities are presented as alternate pathways. The complex 
consisting of dimeric GDNFR and dimeric GDNF can bind two molecules of Ret, 
forming the active signaling complex. As for other PTKs, close contact between the 
intracellular catalytic domains of two Ret molecules is likely to result in receptor 
autophosphorylation. This notion that Ret functions by this mechanism is supported 

20 by the fact that the MEN2 A mutation which causes steady state dimerization of Ret 
results in constitutive activation of the Ret kinase (Santoro et al., 1995). 

Motor neurons have been reported to respond to GDNF with an ED50 of as 

low as 5 fM (Henderson et al., 1994). Although it is difficult to compare binding 
affinity with the ED50 for a biological response, it is possible that very high affinity 

25 GDNF binding sites exist on these cells. Other cells, such as embryonic chick 

sympathetic neurons, have been reported to bind GDNF with a Kd of 1-5 nM (Trupp 
et al., Journal Of Cell Biology. 130, 137-148, 1995). It is unlikely that GDNFR is 
involved in a receptor complex for such low affinity sites, but a weak direct interaction 
between GDNF and Ret may be present. 

30 Expression of c-ret has been observed during embryogenesis in many cell 

lineages of the developing central and peripheral nervous systems, including cells of 
the enteric nervous system (Pachnis, et al., Development, 1 19, 1005-1017, 1993; 
Tsuzuki et al., 1995). Outside the nervous system, c-ret expression has been detected 
in the Wolffian duct, ureteric bud epithelium and collecting ducts of the kidney 

35 (Pachnis, et al., supra; Tsuzuki et al., 1995). Ret expression has also been detected in 
all neuroblastoma cell lines derived from the neural crest (Dceda et al M 1990) and from 
surgically resected neuroblastomas (Nagao et al„ 1990; Takahashi & Cooper, 1987). 
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GDNF expression has been observed in both CNS and PNS, as well as in non- 
neuronal tissues during embryonic development. The levels of GDNF expression 
found in many non-neuronal tissues were higher than in the nervous system (Choi- 
Lundberg and Bohn, Brain Res. Dev. Brain Res. 85, 80-88, 1995). Although 

5 expression of GDNFR has not been extensively studied, primary Northern blot 

analysis detected the presence of high levels of the GDNFR mRNA in the liver, brain, 
and kidney of adult rat and mouse. The similarity of the expression patterns of ret, 
GDNF, and GDNFR in developing nervous system and kidney is consistent with their 
combined action during development. 

10 Mammalian kidney development has been postulated to result from reciprocal 

interactions between the metanephron and the developing ureter, a branch developed 
from the caudal part of the Wolffian duct (Saxen, Organogenesis of the kidney. 
Development and Cell Biology series, Cambridge University Press, Cambridge, 
England, 1987). While the expression of Ret has been found at the ureteric bud but 

15 not in the surrounding mesenchyme in developing embryos, the expression of GDNF 
was detected in the undifferentiated but not adult metanephric cap of the kidney. These 
observations suggest that an interaction between GDNF and Ret is responsible for 
initiating the development of the ureteric structure. Further support for this hypothesis 
is provided by targeted disruptions of the GDNF and ret genes, which result in very 

20 similar phenotypic defects in kidney (Schuchardt et al., Nature. 367, 380-383, 1994; 
Sanchez, in press). Another major phenotypic defect observed in both GDNF (-/-) 
and ret (-/-) knockout animals is a complete loss of the enteric neurons throughout the 
digestive tract. Hirschsprung's disease, a genetic disorder characterized by the 
congenital absence of parasympathetic innervation in the lower intestinal tract, has also 

25 been linked to "loss-of-function" mutations in ret (Romeo et al., Nature. 367, 377- 
378, 1994. Edery et al., 1994). A later report (Angrist et al, Hum. Mol.Genet. 4, 
821-830, 1995) indicated that, contrary to earlier observations, some Hirschsprung's 
patients do not carry mutations in ret. It is now envisioned that such patients may 
carry mutations in GDNF, GDNFR or some other critical component of this signaling 

30 pathway. 

Experiment Procedures 

[125p G DNF Binding to Neuroma Cells F.xnressing GDNFR 
35 Neuro-2a cells (ATCC #CCL 131) were transfected with an expression 

plasmid, as described above, using the Calcium Phosphate Transfection System 
(GBCO/BRL) according to the manufacturer's directions. Transfected cells were 
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selected for expression of the plasmid by growth in 400 ug/mL G4 1 8 antibiotic 
(Sigma). G418 resistant clones were expanded and assayed for GDNFR expression 
by binding to [125i ]GDNF (Ame rsham, Inc., custom iodination, catalog #IMQ1057) 
Cells from each clone were seeded at a density of 3 x i()4 C ells/c m 2 in duplicate wells 
or *Mvell tissue culture plates (Becton Dickinson) pre-coated with polyomithine 
Cells were washed once with ice-cold washing buffer (DMEM containing 25 tnM 
HEPES, pH 7.5) and were then incubated with 50 pM [125 I]GDNF in binding buffer 
(washing buffer plus 0.2% BSA) at 4<>C for four hours either in the presence or 
absence of 500 mM unlabeled GDNF. Cells were then washed four times with ice- 
cold washing buffer, lysed in 1 M NaOH, and the cell-associated radiolabel quantitated 
in a 1470 Wizard Automated Gamma Counter (Wallac Inc.). The amount of GDNFR 
expressed by individual clones was estimated by the ratio of [125 I]GDNF 5^ to 
cells in the absence and presence of unlabeled GDNF. Three clones were chosen as 
representatives of high, moderate, and low level expressors of GDNFR for use in 
binding experiments. The ratios [125t ]gdnf ^ m me abs(Jnce ^ Qf 

unlabeled GDNF for these clones were: NGR-38) 16:1, NGR-16) 12.8:1, and 
NGR-33) 8: 1 . Equilibrium binding of [ 125 I]GDNF t0 NGR . 3g ceUs w&& ^ 
as described above except that concentrations of labeled GDNF ranged from 0.5 pM to 
1 nM. In all assays, nonspecific binding as estimated by the amount of radiolabel 
binding to cells in the presence of 500 nM unlabeled GDNF was subtracted from 
binding in the absence of unlabeled GDNF. Binding data was analyzed by Scatchard 
plot. 
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Chemical Cross-I iflkjnp 

Neuro-2a or NGR-38 cells were washed once with phosphate-buffered saline 
(PBS, pH 7. 1), then treated for four hours at 4°C with 1 or 3 nM [I25i] GE>NF in 
binding buffer in the presence or absence of 500 nM unlabeled GDNF. Following 
binding, cells were washed four times with ice-cold washing buffer and incubated at 
room temperature for 45 minutes with 1 mM bis suberate (BS*, Pierce) in washing 
buffer. The cross-linking reaction was quenched by washing the cells three times with 
Tris-buffered saline (TBS, pH 7.5). The cells were then either lysed directly in 
SDS-PAGE sample buffer (80 mM Tris HC1 [pH 6.8], 10% glycerol, 1% SDS, 
0.025% bromophenol blue) or in Triton X-100 lysis buffer (50 mM Hepes, pH 7 5 
1% Triton X-100, 50 mM NaCl, 50 mM NaF, 10 mM sodium pyrophosphate 1% ' 
aprotinin (Sigma, Cat.# A-6279), 1 mM PMSF (Sigma, Cat.# P-7626), 0.5 mM 
Na 3 V0 4 (Fisher Cat.# S454-50). The lysates were clarified by centrifugation, 
incubated with 5 ng/mL of anti-Ret antibody (Santa Cruz Antibody, C-19, Cat. 
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#SC-167), and the resulting immunocomplexes were collected by precipitation with 
protein A-Sepharose CL-4B (Pharmacia). The imraunoprecipitates were washed three 
times with the lysis buffer, once with 0.5% NP-40 containing 50 mM NaCl and 
20 mM Tris-Cl, pH 7.5, and were then resuspended in SDS-PAGE sample buffer 
5 Both the whole cell lysates and the immunoprecipitates were fractionated by 7.5% 
SDS-PAGE with a ratio of Bis:Acrylamide at 1:200. 

Westem Bint Analysis 

The autophosphorylation of Ret receptor was examined by Western blot 
10 analysis. Briefly, cells were seeded 24 hours prior to the assay in 6-well tissue culture 
d.shes at a density of 1 .5 x K)6 cells /well. Cells were washed once with binding 
buffer and treated with various concentrations of different reagents (including GDNF, 
PI-PLC, PI-PLC/CM, and Ret-Fc fusion protein), either alone or in combination in ' 
binding buffer for various periods of times. Treated cells and untreated controls were 
15 lysed in Triton X-100 lysis buffer and immunoprecipitated with the anti-Ret antibody 
(Santa Cruz, C-19, Cat. #SC-167) and protein-A Sepharose as described above. 
Immunoprecipitates were fractionated by SDS-PAGE and transferred to nitrocellulose 
membranes as described by Harlow and Lane (Antibodies: A Laboratory Manual. 
Cold Spring Harbor Laboratory: Cold Spring Harbor, New York, 1 988). The 
20 membranes were pre-blocked with 5% BSA (Sigma) and the level of tyrosine 

phosphorylation of the receptor was determined by blotting the membrane with an anti- 
phosphotyrosine monoclonal antibody 4G10 (UBI, Cat. #05-321) at room temperature 
for two hours. The amount of protein included in each lane was determined by 
stripping and re-probing the same membrane with the anti-Ret antibody. Finally the 
25 membrane was treated with chemiluminescence reagents (ECL, Amersham) following 
the manufacturer's instructions and exposed to X-ray films (Hyperfilm-ELC, 
Amersham). 
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Treatment of Qih w i th n-p T ,c „nd GrnrrMton nf pt.pt r TT nfpd Mnnwl ^ 

In order to release GPI-linked GDNFR from the cell surface, cells were 
washed once with washing buffer, then incubated with 1 U/mL pbosphatidylinositol 
specific phospholipase C (PI-PLC, Boehringer Mannheim, Cat. #1 143069) in binding 
buffer at 370 C for 45 minutes. The cells were then washed three times with washing 
buffer and further processed for Ret autophosphorylation assay or cross-linking. For 
generation of PI-PLC treated conditioned media (PI-PLC/CM), 8 x 10* cells were 
removed from tissue culture dishes by treating the cells with PBS containing 2 mM of 
EDTA at 370 C for 5 to 10 minutes. Cells were washed once with washing buffer, 
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resuspended in 1 mL of binding buffer containing 1 U/mL of PI-PLC, and incubated 
at 370C for 45 minutes. The cells were pelleted, and the PI-PLC/CM was collected. 

Preparation of the Ret-Fr Fusion Pm t f j n 

A cDNA encompassing the entire coding region of c-Ret was isolated from a 
day 1 7 rat placenta cDNA library using an oligonucleotide probe corresponding to the 
first 20 amino acids of the mouse c-Ret (Iwamoto et al., 1993; van Heyningen, 1994) 
The region coding for the extracellular domain of the Ret receptor (ending with the last 
amino acid, R636) was fused in-frame with the DNA coding for the Fc region of 
human IgG (IgGl) and subcloned into the expression vector pDSR2 as previously 
described (Bartley et al., Nature. 368, 558-560, 1994). The ret-Fc/pDSRa2 plasmid 
was transfected into Chinese hamster ovary (CHO) cells and the recombinant Ret-Fc 
fusion protein was purified by affinity chromatography using a Ni** column 
(Qiagen). 

Preparation of Embryonic Rat Spinal r n tf M otor Nenmn r^ mm 

Enriched embryonic rat spinal cord motor neuron cultures were prepared from 
entire spinal cords of E15 Sprague-Dawley rat fetuses 24 hours before the 
experiments. The spinal cords were dissected, and the meninges and dorsal root 
ganglia (DRGs) were removed. The spinal cords were cut into smaller fragments and 
digested with papain in L15 medium (Papain Kit, Worthington). The motor neurons, 
which are larger than other types of cells included in the dissociated cell suspension, 
were enriched using a 6.8% Metrizamide gradient (Camu and Henderson, 
J Neuroscience. 44, 59-70, 1992). Enriched motor neurons residing at the interface 
between the metrizamide cushion and the cell suspension were collected, washed, and 
seeded in tissue culture dishes pre-coated with poly-L-ornithine and laminin at a 
density of -9 x 10* cells/cm 2 and were cultured at 37<>C. 



30 Various references are cited herein, the disclosures of which are incorporated 

by reference in their entireties. 

While the present invention has been described in terms of preferred 

embodiments and exemplary nucleic acid and amino acid sequences, it is understood 

that variations and modifications will occur to those skilled in the art. Therefore, it is 
35 intended that the appended claims cover all such equivalent variations which come 

within the scope of the invention as claimed. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Pox, Gary M 

Jing, Shuqian 
Wen, Duan2hi 



(ii) TITLE OF INVENTION: GLIAL CELL LINE * DERIVED NEUROTROPHIC 
FACTOR RECEPTOR 

(iii) NUMBER OF SEQUENCES: 34 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: AMGEN INC 

(B) STREET : 1840 DeHavilland Drive 

(C) CITY: Thousand Oaks 

(D) STATE: CA 

(E) COUNTRY: US 

(F) ZIP: 91320-1789 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US unknown 

(B) FILING DATE: 14 -APR- 1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/017,221 

(B) FILING DATE: 09-MAY-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/015,907 

(B) FILING DATE: 22 -APR -199 6 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Curry, Daniel R. 

(B) REGISTRATION NUMBER: 32,727 

(C) REFERENCE /DOCKET NUMBER: A-401A 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2568 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) 



MOLECULE TYPE: cDNA 
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60 
120 
180 
240 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 540.. 1934 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

AATCTGGCCT CGGAACACGC CATTCTCCGC GCCGCTTCCA ATAACCACTA ACATCCCTAA 

CGAGCATCCG AGCCGAGGGC TCTGCTCGGA AATCGTCCTG GCCCAACTCG GCCCTTCGAG 

CTCTCGAAGA TTACCGCATC TATTTTTTTT TTCTTTTTTT TCTTTTCCTA GCGCAGATAA 

AGTGAGCCCG GAAAGGGAAG GAGGGGGCGG GGACACCATT GCCCTGAAAG AATAAATAAG 

TAAATAAACA AACTGGCTCC TCGCCGCAGC TGGACGCGGT CGGTTGAGTC CAGGTTGGGT 300 

CGGACCTGAA CCCCTAAAAG CGGAACCGCC TCCCGCCCTC GCCATCCCGG AGCTGAGTCG 360 

CCGGCGGCGG TGGCTGCTGC CAGACCCGGA GTTTCCTCTT TCACTGGATG GAGCTGAACT 

TTGGGCGGCC AGAGCAGCAC AGCTGTCCGG GGATCGCTGC ACGCTGAGCT CCCTCGGCAA 

GACCCAGCGG CGGCTCGGGA TTTTTTTGGG GGGGCGGGGA CCAGCCCCGC GCCGGCACC 

ATG TTC CTG GCG ACC CTG TAC TTC GCG CTG CCG CTC TTG GAC TTG CTC 
Met Phe Leu Ala Thr Leu Tyr Phe Ala Leu Pro Leu Leu Asp Leu Leu 
1 5 io 15 

CTG TCG GCC GAA GTG AGC GGC GGA GAC CGC CTG GAT TGC GTG AAA GCC 
Leu Ser Ala Glu Val Ser Gly Gly Asp Arg Leu Asp Cys Val Lys Ala 
20 25 30 

AGT GAT CAG TGC CTG AAG GAG CAG AGC TGC AGC ACC AAG TAC CGC ACG 
Ser Asp Gin Cys Leu Lys Glu Gin Ser Cys Ser Thr Lys Tyr Arg Thr 
35 40 45 

CTA AGG CAG TGC GTG GCG GGC AAG GAG ACC AAC TTC AGC CTG GCA TCC 
Leu Arg Gin Cys Val Ala Gly Lys Glu Thr Asn Phe Ser Leu Ala Ser 
50 55 60 

GGC CTG GAG GCC AAG GAT GAG TGC CGC AGC GCC ATG GAG GCC CTG AAG 
Gly Leu Glu Ala Lys Asp Glu Cys Arg Ser Ala Met Glu Ala Leu Lys 
65 70 75 80 

CAG AAG TCG CTC TAC AAC TGC CGC TGC AAG CGG GGT ATG AAG AAG GAG 
Gin Lys Ser Leu Tyr Asn Cys Arg Cys Lys Arg Gly Met Lys Lys Glu 
85 90 95 

AAG AAC TGC CTG CGC ATT TAC TGG AGC ATG TAC CAG AGC CTG CAG GGA 875 
Lys Asn Cys Leu Arg He Tyr Trp Ser Met Tyr Gin Ser Leu Gin Gly 
100 105 no 

AAT GAT CTG CTG GAG GAT TCC CCA TAT GAA CCA GTT AAC AGC AGA TTG 923 
Asn Asp Leu Leu Glu Asp Ser Pro Tyr Glu Pro Val Asn Ser Arg Leu 
115 120 125 



420 
480 
539 
587 

635 

683 

731 

779 

827 
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TCA GAT ATA TTC CGG GTG GTC CCA TTC ATA TCA GAT GTT TTT CAG CAA 
Ser Asp lie Phe Arg Val Val Pro P he lie Ser Asp Val Z tin Gin 

u=> 140 



971 



Z? ™ SI S ^ « * ™ ** CTC GAT GCA GCG AAG GCC , 01 o 

^ _ UAy Asn ^ Cys Leu Asp Ma ^ 



155 



160 



TGC AAC CTC GAC GAC ATT TGC AAG AAG TAG AGO TCG GCG TAC ATC ACC 
Cys Asn Leu Asp Asp lie Cys Lys Lys Tyr Arg Ser Sa £r ne ihr 
165 "0 175 

CCG TGC ACC ACC AGC GTG TCC AAC GAT GTC TGC AAC CGC CGC aap *rr 
Pro Cys Thr Thr Ser Val Ser As, Asp Val Cys £ lf g « ^ £ 

165 190 

CAC AAG GCC CTC CGG CAG TTC TTT GAC AAG GTC CCG GCC AAG CAC AGC 
Hi. Lys Ala Leu Arg Gin Phe Phe Asp Lys Val Pro Ala £s £S Ser 

195 200 205 

£r GiJ nTt pf J* = CC TGC CGG ^ ATC GCC TGC ACA «0 CGG 
Tyr Gly Met Leu Phe Cys Ser Cys Arc, Asp lie Ala Cys Thr Glu Arg 

215 220 

AGG CGA CAG ACC ATC GTG CCT GTG TGC TCC TAT GAA GAG AGG GAG *»r 
Arg Arg Gin Thr lie Val Pro Val Cys Ser Tyr Glu Jrg Su £s 
5 230 235 240 

CCC AAC TGT TTG AAT TTG CAG GAC TCC TGC AAG ACG AAT TAC ATC TGC 
Pro Asn cys Leu Asn Leu Gin Asp Ser Cys Lys Thr ™ £r lie Cys 
245 250 255 



AGA TCT CGC CTT GCG GAT TTT TTT ACC AAC TGC CAG CCA GAG TCA AGG 
Arg Ser Arg Leu Ala Asp Phe Phe Thr Asn Cys Gin Pro Glu Ser £g 
260 265 270 

Ser SI? f° f ^ CTA **° GAA »« TAC GCT GAC TGC CTC CTC GCC 
Ser Val Ser Ser Cys Leu Lys Glu Asn Tyr Ala Asp Cys Leu Leu Sa 

275 280 285 

Sr Ser Su Jf rf JS* ^ A *° ACC CCC ^ TAC ATA «* 

Tyr Ser Gly Leu lie Gly Thr Val Met Thr Pro Asn Tyr lie Asp Ser 

295 300 

AGT AGC CTC AGT GTG GCC CCA TGG TGT GAC TGC AGC AAC AGT CTC &1 r 
•jr Ser Leu Ser Val Ala Pro Trp Cys Asp £ Sel Z ser £ £n 

310 315 320 

f ^ ?f TTG *** TTT 170 ^ T TTC ™ AAG GAC AAT ACA 

Asp Leu Glu Glu cys Leu Lys Phe Leu Asn Phe Phe Lys Asp £n 

325 330 335 

TGT CTT AAA AAT GCA ATT CAA GCC TTT GGC AAT GGC TCC GAT GTG ACC 
Cys Leu Lys Asn Ala He Gin Ala Phe Gly Asn Gly Ser Asp Val £hr 
340 345 3 5 o 



1067 



1115 



1163 



1211 



1259 



1307 



1355 



1403 



1451 



1499 



1547 



1595 
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GTG TGG CAG CCA GCC TTC CCA GTA CAG ACC ACC ACT GCC ACT ACC ACC 1643 
Val Trp Gin Pro Ala Phe Pro Val Gin Thr Thr Thr Ala Thr Thr Thr 
355 360 365 

ACT GCC CTC CGG GTT AAG AAC AAG CCC CTG GGG CCA GCA GGG TCT GAG 1691 
Thr Ala Leu Arg Val Lys Asn Lys Pro Leu Gly Pro Ala Gly Ser Glu 
370 375 380 



AAT GAA ATT CCC ACT CAT GTT TTG CCA CCG TGT GCA AAT TTA CAG GCA 
Asn Glu lie Pro Thr His Val Leu Pro Pro Cys Ala Asn Leu Gin Ala 
385 390 395 400 

CAG AAG CTG AAA TCC AAT GTG TCG GGC AAT ACA CAC CTC TGT ATT TCC 
Gin Lys Leu Lys Ser Asn Val Ser Gly Asn Thr His Leu Cys lie Ser 
405 4i 0 415 



1739 



1787 



AAT GGT AAT TAT GAA AAA GAA GGT CTC GGT GCT TCC AGC CAC ATA ACC 1835 
Asn Gly Asn Tyr Glu Lys Glu Gly Leu Gly Ala Ser Ser His lie Thr 
420 425 430 

ACA AAA TCA ATG GCT GCT CCT CCA AGC TGT GGT CTG AGC CCA CTG CTG 1883 
Thr Lys Ser Met Ala Ala Pro Pro Ser Cys Gly Leu Ser Pro Leu Leu 
435 440 445 

GTC CTG GTG GTA ACC GCT CTG TCC ACC CTA TTA TCT TTA ACA GAA ACA 1931 
Val Leu Val Val Thr Ala Leu Ser Thr Leu Leu Ser Leu Thr Glu Thr 
450 455 460 

TCA TAGCTGCATT AAAAAAATAC AATATGGACA TGTAAAAAGA CAAAAACCAA 1984 

Ser 

465 

GTTATCTGTT TCCTGTTCTC TTGTATAGCT GAAATTCCAG TTTAGGAGCT CAGTTGAGAA 2044 

ACAGTTCCAT TCAACTGGAA CATTTTTTTT TTTNCCTTTT AAGAAAGCTT CTTGTGATCC 2104 

TTNGGGGCTT CTGTGAAAAA CCTGATGCAG TGCTCCATCC AAACTCAGAA GGCTTTGGGA 2164 

TATGCTGTAT TTTAAAGGGA CAGTTTGTAA CTTGGGCTGT AAAGCAAACT GGGGCTGTGT 2224 

TTTCGATGAT GATGATNATC ATGATNATGA TNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 2284 

NNNNNNNNNN GATTTTAACA GTTTTACTTC TGGCCTTTCC TAGCTAGAGA AGGAGTTAAT 2344 

ATTTCTAAGG TAACTCCCAT ATCTCCTTTA ATGACATTGA TTTCTAATGA TATAAATTTC 24 04 

AGCCTACATT GATGCCAAGC TTTTTTGCCA CAAAGAAGAT TCTTACCAAG AGTGGGCTTT 24 64 

GTGGAAACAG CTGGTACTGA TGTTCACCTT TATATATGTA CTAGCATTTT CCACGCTGAT 2524 

GTTTATGTAC TGTAAACAGT TCTGCACTCT TGTACAAAAG AAAA 2 568 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 465 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Phe Leu Ala Thr Leu Tyr Phe Ala Leu Pro Leu Leu Asp Leu Leu 
15 10 15 

Leu Ser Ala Glu Val Ser Gly Gly Asp Arg Leu Asp Cys Val Lys Ala 
20 25 30 

Ser Asp Gin Cys Leu Lys Glu Gin Ser Cys Ser Thr Lys Tyr Arg Thr 
35 40 45 

Leu Arg Gin Cys Val Ala Gly Lys Glu Thr Asn Phe Ser Leu Ala Ser 
50 55 60 

Gly Leu Glu Ala Lys Asp Glu Cys Arg Ser Ala Met Glu Ala Leu Lys 
65 70 75 80 

Gin Lys Ser Leu Tyr Asn Cys Arg Cys Lys Arg Gly Met Lys Lys Glu 
85 90 95 

Lys Asn Cys Leu Arg lie Tyr Trp Ser Met Tyr Gin Ser Leu Gin Gly 
100 105 110 

Asn Asp Leu Leu Glu Asp Ser Pro Tyr Glu Pro Val Asn Ser Arg Leu 
115 120 125 

Ser Asp He Phe Arg Val Val Pro Phe He Ser Asp Val Phe Gin Gin 
130 135 140 

Val Glu His He Pro Lys Gly Asn Asn Cys Leu Asp Ala Ala Lys Ala 
145 150 155 160 

Cys Asn Leu Asp Asp He Cys Lys Lys Tyr Arg Ser Ala Tyr He Thr 
165 170 175 

Pro Cys Thr Thr Ser Val Ser Asn Asp Val Cys Asn Arg Arg Lys Cys 
180 1B5 190 

His Lys Ala Leu Arg Gin Phe Phe Asp Lys Val Pro Ala Lys His Ser 
195 200 205 

Tyr Gly Met Leu Phe Cys Ser Cys Arg Asp He Ala Cys Thr Glu Arg 
210 215 220 

Arg Arg Gin Thr He Val Pro Val Cys Ser Tyr Glu Glu Arg Glu Lys 
225 230 235 240 
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Pro Asn Cys Leu Asn Leu Gin Asp Ser Cys Lys Thr Asn Tyr He Cy S 

245 "0 255 

Arg Ser Arg Leu Ala Asp Phe Phe Thr Asn Cys Gin Pro Glu Ser Arg 

265 270 

Ser Val Ser Ser Cys Leu Lys Glu Asn Tyr Ala Asp Cys Leu Leu Ala 
275 260 285 

Tyr Ser Gly Leu He Gly Thr Val Met Thx Pro A sn Tyr He Asp Ser 

295 300 

Ser Ser Leu Ser Val Ala Pro Trp Cys Asp Cys Ser Asn Ser Gly ^ 



315 



Asp Leu Glu Glu Cys Leu Lys Phe Leu Asn Phe Phe Lys Asp 

j * D 



330 



320 



Asn Thr 



Cys Leu Lys Asn Ala He Gin Ala Phe Gly Asn Gly Ser Asp 



3 <° 345 



335 



Val Thr 



350 



Val Trp Gin Pro Ala Phe Pro Val Gin Thr Thr Thr Ala Thr Thr Thr 

Thr Ala Leu Arg Val Lys Asn Lys Pro Leu Gly Pro Ala Gly Ser Glu 

375 380 

385 ^ Sf Val L6U Pr ° Pro <*■ Al. Asn Leu Gin Ala 

390 1<)* 

400 



Gin Lys Leu Lys Ser Asn Val Ser Gly Asn Thr His Leu Cys He Ser 
405 410 415 

Asn Gly Asn Tyr Glu Lys Glu Gly Leu Gly Ala Ser Ser His He Thr 
420 425 



430 



Thr Lys ser Met Ala Ala Pro Pro Ser Cys Gly Leu Ser Pro Leu Leu 

440 445 



Val Leu Val Val Thr Ala Leu Ser Thr Leu Leu Ser Leu Thr Glu 

455 460 

Ser 
465 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2138 base pairs 

(B) type: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



Thr 



(ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 302. .1705 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
AGCTCGCTCT CCCGGGGCAG TGGTGTGGAT GCACCGGAGT TCGGGCGCTG GGCAAGTTGG 
GTCGGAACTG AACCCCTGAA AGCGGGTCCG CCTCCCGCCC TCGCGCCCGC CCGGATCTGA 
GTCGCTGGCG GCGGTGGGCG GCAGAGCGAC GGGGAGTCTG CTCTCACCCT GGATGGAGCT 
GAACTTTGAG TGGCCAGAGG AGCGCAGTCG CCCGGGGATC GCTGCACGCT GAGCTCTCTC 
CCCGAGACCG GGCGGCGGCT TTGGATTTTG GGGGGGCGGG GACCAGCTGC GCGGCGGCAC 

C Met Z S £ £ £ 2 « ? TC «* « «■ - ™ 

a mr i* u Tyr Phe Ala Leu Pro Leu Leu Asp Leu 
5 10 15 

2 2 = S 2 2 2 5 5 5 - 2 2 - s £ 
2 2 2 2 2 2 2 £ £ 2 2 2 2 2 2 
2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 
=5=2222222222222 

70 ?5 

2 2 2 2 2 2 2 2 " "° »» 

80 « ASn CyS Arg <*» L ^ s Arg Gly Met Lys Lys 

90 

=£===22222222222 

105 no 

G^ £n f C ? ^ GAA ^ TCC CCG TAT ^G CCG GTT AAC AGC AGG 

Gly Asn Asp Leu Leu Giu Asp Ser Pro ^ Glu ^ ™ £0 AGC AGG 

120 125 

S S Asp s: S 2; s v G Ij ? G Ih° r TCA GAT GTT TTC <*» 

e roe Arg Ala Val Pro Phe He Ser Asp Val Phe Gin 
135 140 

2 2 2 22 2 2 2 r r* wc CT0 "* ~ « « 

14 5 J*! V ASn Asn Cys Leu As P Ala A la Lys 

150 155 



60 
120 
180 
240 
300 
346 

394 

442 

490 

538 



586 



634 



€62 



730 
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GCC TGC AAC CTG GAC GAC ACC TGT AAG AAG TAC AGG TCG GCC TAC ATC 826 
Ala Cys Asn Leu Asp Asp Thr Cys Lys Lys Tyr Arg Ser Ala Tyr lie 
160 165 170 175 

ACC CCC TGC ACC ACC AGC ATG TCC AAC GAG GTC TGC AAC CGC CGT AAG 874 
Thr Pro Cys Thr Thr Ser Met Ser Asn Glu Val Cys Asn Arg Arg Lys 
180 185 190 

TGC CAC AAG GCC CTC AGG CAG TTC TTC GAC AAG GTT CCG GCC AAG CAC 922 
Cys His Lys Ala Leu Arg Gin Phe Phe Asp Lys Val Pro Ala Lys His 
195 200 205 

AGC TAC GGG ATG CTC TTC TGC TCC TGC CGG GAC ATC GCC TGC ACC GAG 970 
Ser Tyr Gly Met Leu Phe Cys Ser Cys Arg Asp He Ala Cys Thr Glu 
210 215 220 

CGG CGG CGA CAG ACT ATC GTC CCC GTG TGC TCC TAT GAA GAA CGA GAG 1018 
Arg Arg Arg Gin Thr He Val Pro Val Cys Ser Tyr Glu Glu Arg Glu 
225 230 235 

AGG CCC AAC TGC CTG AGT CTG CAA GAC TCC TGC AAG ACC AAT TAC ATC 1066 
Arg Pro Asn Cys Leu Ser Leu Gin Asp Ser Cys Lys Thr Asn Tyr He 
240 245 250 255 

TGC AGA TCT CGC CTT GCA GAT TTT TTT ACC AAC TGC CAG CCA GAG TCA 1114 
Cys Arg Ser Arg Leu Ala Asp Phe Phe Thr Asn Cys Gin Pro Glu Ser 
260 265 270 

AGG TCT GTC AGC AAC TGT CTT AAG GAG AAC TAC GCA GAC TGC CTC CTG 1162 
Arg Ser Val Ser Asn Cys Leu Lys Glu Asn Tyr Ala Asp Cys Leu Leu 
275 280 285 

GCC TAC TCG GGA CTG ATT GGC ACA GTC ATG ACT CCC AAC TAC GTA GAC 1210 
Ala Tyr Ser Gly Leu He Gly Thr Val Met Thr Pro Asn Tyr Val Asp 
290 295 300 

TCC AGC AGC CTC AGC GTG GCA CCA TGG TGT GAC TGC AGC AAC AGC GGC 1258 
Ser Ser Ser Leu Ser Val Ala Pro Trp Cys Asp Cys Ser Asn Ser Gly 
305 310 315 

AAT GAC CTG GAA GAC TGC TTG AAA TTT CTG AAT TTT TTT AAG GAC AAT 1306 
Asn Asp Leu Glu Asp Cys Leu Lys Phe Leu Asn Phe Phe Lys Asp Asn 
320 325 330 335 

ACT TGT CTC AAA AAT GCA ATT CAA GCC TTT GGC AAT GGC TCA GAT GTG 1354 
Thr Cys Leu Lys Asn Ala He Gin Ala Phe Gly Asn Gly Ser Asp Val 
340 345 350 

ACC ATG TGG CAG CCA GCC CCT CCA GTC CAG ACC ACC ACT GCC ACC ACT 1402 
Thr Met Trp Gin Pro Ala Pro Pro Val Gin Thr Thr Thr Ala Thr Thr 
355 360 365 

ACC ACT GCC TTC CGG GTC AAG AAC AAG CCT CTG GGG CCA GCA GGG TCT 14 50 

Thr Thr Ala Phe Arg Val Lys Asn Lys Pro Leu Gly Pro Ala Gly Ser 
370 375 380 
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GAG AAT GAG ATC CCC ACA CAC GTT TTA CCA CCC TGT GCG AAT TTG CAG 

Glu Asn Glu lie Pro Thr His Val Leu Pro Pro Cys Ala Asn Leu Gin 

385 390 395 



ATA ACC ACA AAA TCA ATG GCT GCT CCT CCC AGC TGC AGT CTG AGC TCA 
He Thr Thr Lys Ser Met Ala Ala Pro Pro Ser Cys Ser Leu Ser Ser 
435 440 445 

CTG CCG GTG CTG ATG CTC ACC GCC CTT GCT GCC CTG TTA TCT GTA TCG 
Leu Pro Val Leu Met Leu Thr Ala Leu Ala Ala Leu Leu Ser Val Ser 
450 455 460 



1498 



GCT CAG AAG CTG AAA TCC AAT GTG TCG GGT AGC ACA CAC CTC TGT CTT 1546 
Ala Gin Lys Leu Lys Ser Asn Val Ser Gly Ser Thr His Leu Cvs l*u 
400 405 410 " 415 

TCT GAT AGT GAT TTC GGA AAG GAT GGT CTC GCT GGT GCC TCC AGC CAC 1594 
Ser Asp Ser Asp Phe Gly Lys Asp Gly Leu Ala Gly Ala Ser Ser His 
420 425 430 



1642 



1690 



TTG GCA GAA ACG TCG TAGCTGCATC CGGGAAAACA GTATGAAAAG ACAAAAGAGA 174 5 

Leu Ala Glu Thr Ser 



465 














ACCAAGTATT 


CTGTCCCTGT 


CCTCTTGTAT 


ATCTGAAAAT 


CCAGTTTTAA 


AAGCTCCGTT 


1805 


GAGAAGCAGT 


TTCACCCAAC 


TGGAACTCTT 


TCCTTGTTTT 


TAAGAAAGCT 


TGTGGCCCTC 


1865 


AGGGGCTTCT 


GTTGAAGAAC 


TGCTACAGGG 


CTAATTCCAA 


ACCCATAAGG 


CTCTGGGGCG 


1925 


TGGTGCGGCT 


TAAGGGGACC 


ATTTGCACCA 


TGTAAAGCAA 


GCTGGGCTTA 


TCATGTGTTT 


1985 


GATGGTGAGG 


ATGGTAGTGG 


TGATGATGAT 


GGTAATTTTA 


ACAGCTTGAA 


CCCTGTTCTC 


2045 


TCTACTGGTT 


AGGAACAGGA 


GATACTATTG 


ATAAAGATTC 


TTCCATGTCT 


TACTCAGCAG 


2105 


CATTGCCTTC 


TGAAGACAGG 


CCCGCAGCCG 


TCG 






2138 



(2) INFORMATION FOR SEQ ID N0:4: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 468 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Phe Leu Ala Thr Leu Tyr Phe Ala Leu Pro Leu Leu Asp Leu Leu 
15 10 is 



Met Ser Ala Glu Val Ser Gly Gly Asp Arg Leu Asp Cys Val Lys Ala 
20 25 30 
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Ser Asp Gin Cys Leu Lys Glu Gin Ser Cys Ser Thr Lys Tyr Arg Thr 
35 40 45 

Leu Arg Gin Cys Val Ala Gly Lys Glu Thr Asn Phe Ser Leu Thr Ser 
50 55 60 

Gly Leu Glu Ala Lys Asp Glu Cys Arg Ser Ala Met Glu Ala Leu Lys 
65 70 75 lo 

Gin Lys Ser Leu Tyr Asn Cys Arg Cys Lys Arg Gly Met Lys Lys Glu 
85 90 95 

Lys Asn Cys Leu Arg lie Tyr Trp Ser Met Tyr Gin Ser Leu Gin Gly 
100 105 ,m 



Asn Asp Leu Leu Glu Asp Ser Pro Tyr Glu Pro Val Asn Ser Arg Leu 
H5 120 125 

Ser Asp lie Phe Arg Ala Val Pro Phe lie Ser Asp Val Phe Gin Gin 
0 135 140 

Val Glu His lie Ser Lys Gly Asn Asn Cys Leu Asp Ala Ala Lys Ala 



145 150 15s 



160 



Cys Asn Leu Asp Asp Thr Cys Lys Lys Tyr Arg Ser Ala Tyr He Thr 
I 65 170 175 

Pro Cys Thr Thr Ser Met Ser Asn Glu Val Cys Asn Arg Arg Lys Cys 
180 185 190 

His Lys Ala Leu Arg Gin Phe Phe Asp Lys Val Pro Ala Lys His Ser 
195 200 205 

Tyr Gly Met Leu Phe Cys Ser Cys Arg Asp lie Ala Cys Thr Glu Arg 
* 10 215 220 

Arg Arg Gin Thr lie Val Pro Val Cys Ser Tyr Glu Glu Arg Glu Arg 
225 230 235 240 

Pro Asn Cys Leu Ser Leu Gin Asp ser Cys Lys Thr Asn Tyr lie Cys 

245 250 255 

Arg Ser Arg Leu Ala Asp Phe Phe Thr Asn Cys Gin Pro Glu Ser Arg 



260 265 



270 



Ser Val Ser Asn Cys Leu Lys Glu Asn Tyr Ala Asp Cys Leu Leu Ala 



275 280 



285 



Tyr Ser Gly Leu lie Gly Thr Val Met Thr Pro Asn Tyr Val Asp Ser 
290 "5 3 00 

Ser ser Leu Ser Val Ala Pro Trp Cys Asp Cys Ser Asn Ser Gly Asn 

310 3 15 320 

Asp Leu Glu Asp Cys Leu Lys Phe Leu Asn Phe Phe Lys Asp Asn Thr 
325 330 335 
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Cys Leu Lys Asn Ala lie Gin Ala Phe Gly Asn Gly Ser Asp Val Thr 
340 345 350 

Met Trp Gin Pro Ala Pro Pro Val Gin Thr Thr Thr Ala Thr Thr Thr 
355 360 365 

Thr Ala Phe Arg Val Lys Asn Lys Pro Leu Gly Pro Ala Gly Ser Glu 
370 375 380 

Asn Glu He Pro Thr His Val Leu Pro Pro Cys Ala Asn Leu Gin Ala 
385 390 395 400 

Gin Lys Leu Lys Ser Asn Val Ser Gly Ser Thr His Leu Cys Leu Ser 
405 410 415 

Asp Ser Asp Phe Gly Lys Asp Gly Leu Ala Gly Ala Ser Ser His He 
420 425 430 

Thr Thr Lys Ser Met Ala Ala Pro Pro Ser Cys Ser Leu Ser Ser Leu 
435 440 445 

Pro Val Leu Met Leu Thr Ala Leu Ala Ala Leu Leu Ser Val Ser Leu 
4S0 455 460 

Ala Glu Thr Ser 
465 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3209 base pairs 

(B) TYPE-, nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: raisc_£eature 

(B) LOCATION: 1..539 

(D) OTHER INFORMATION: /note* "1 to 539 is -237 to 301 of 
Figure 5 Gdnfr" 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 540.. 1937 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
AATCTGGCCT CGGAACACGC CATTCTCCGC GCCGCTTCCA ATAACCACTA ACATCCCTAA 60 



CGAGCATCCG AGCCGAGGGC TCTGCTCGGA AATCGTCCTG GCCCAACTCG GCCCTTCGAG 
CTCTCGAAGA TTACCGCATC TATTTTTTTT TTCTTTTTTT TCTTTTCCTA GCGCAGATAA 



120 



180 
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AGTGAGCCCG GAAAGGGAAG GAGGGGGCGG GGACACCATT GCCCTGAAAG AATAAATAAG 240 

TAAATAAACA AACTGGCTCC TCGCCGCAGC TGGACGCGGT CGGTTGAGTC CAGGTTGGGT 3 00 

CGGACCTGAA CCCCTAAAAG CGGAACCGCC TCCCGCCCTC GCCATCCCGG AGCTGAGTCG 360 

CCGGCGGCGG TGGCTGCTGC CAGACCCGGA GTTTCCTCTT TCACTGGATG GAGCTGAACT 420 

TTGGGCGGCC AGAGCAGCAC AGCTGTCCGG GGATCGCTGC ACGCTGAGCT CCCTCGGCAA 480 

GACCCAGCGG CGGCTCGGGA TTTTTTTGGG GGGGCGGGGA CCAGCCCCGC GCCGGCACC 539 

ATG TTC CTG GCG ACC CTG TAG TTC GCG CTG CCG CTC TTG GAC TTG CTC 5B7 
Met Phe Leu Ala Thr Leu Tyr Phe Ala Leu Pro Leu Leu Asp Leu Leu 
15 10 15 

CTG TCG GCC GAA GTG AGC GGC GGA GAC CGC CTG GAT TGC GTG AAA GCC 63 5 

Leu Ser Ala Glu Val Ser Gly Gly Asp Arg Leu Asp Cys Val Lys Ala 
20 25 30 

AGT GAT CAG TGC CTG AAG GAG CAG AGC TGC AGC ACC AAG TAC CGC ACG 6 83 

Ser Asp Gin Cys Leu Lys Glu Gin Ser Cys Ser Thr Lys Tyr Arg Thr 
35 40 45 

CTA AGG CAG TGC GTG GCG GGC AAG GAG ACC AAC TTC AGC CTG GCA TCC 731 
Leu Arg Gin Cys Val Ala Gly Lys Glu Thr Asn Phe Ser Leu Ala Ser 
50 55 60 

GGC CTG GAG GCC AAG GAT GAG TGC CGC AGC GCC ATG GAG GCC CTG AAG 779 
Gly Leu Glu Ala Lys Asp Glu Cys Arg Ser Ala Met Glu Ala Leu Lys 
65 70 75 80 

CAG AAG TCG CTC TAC AAC TGC CGC TGC AAG CGG GGT ATG AAG AAG GAG 827 
Gin Lys Ser Leu Tyr Asn Cys Arg Cys Lys Arg Gly Met Lys Lys Glu 
85 90 95 

AAG AAC TGC CTG CGC ATT TAC TGG AGC ATG TAC CAG AGC CTG CAG GGA 875 
Lys Asn Cys Leu Arg He Tyr Trp Ser Met Tyr Gin Ser Leu Gin Gly 
100 105 HO 

AAT GAT CTG CTG GAG GAT TCC CCA TAT GAA CCA GTT AAC AGC AGA TTG 923 
Asn Asp Leu Leu Glu Asp Ser Pro Tyr Glu Pro Val Asn Ser Arg Leu 
115 120 125 

TCA GAT ATA TTC CGG GTG GTC CCA TTC ATA TCA GAT GTT TTT CAG CAA 971 
Ser Asp He Phe Arg Val Val Pro Phe He Ser Asp Val Phe Gin Gin 
130 135 140 

GTG GAG CAC ATT CCC AAA GGG AAC AAC TGC CTG GAT GCA GCG AAG GCC 1019 
Val Glu His He Pro Lys Gly Asn Asn Cys Leu Asp Ala Ala Lys Ala 
145 150 155 160 

TGC AAC CTC GAC GAC ATT TGC AAG AAG TAC AGG TCG GCG TAC ATC ACC 1067 
Cys Asn Leu Asp Asp He Cys Lys Lys Tyr Arg Ser Ala Tyr He Thr 
165 170 175 
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CCG TGC ACC ACC AGC GTG TCC AAN GAT GTC TGC AAC CGC CGC AAG TGC 1115 

Pro Cys Thr Thr Ser Val Ser Xaa Asp Val Cys Asn Arg Arg Lys Cys 
180 185 190 

CAC AAG GCC CTC CGG CAG TTC TTT GAC AAG GTC CCG GCC AAG CAC AGC 1163 
His Lys Ala Leu Arg Gin Phe Phe Asp Lys Val Pro Ala Lys His Ser 
155 200 205 

TAC GGA ATG CTC TTC TGC TCC TGC CGG GAC 1 ATC GCC TGC ACA GAG CGG 1211 
Tyr Gly Met Leu Phe Cys Ser Cys Arg Asp lie Ala Cys Thr Glu Arg - 
210 215 220 

AGG CGA CAG ACC ATC GTG CCT GTG TGC TCC TAT GAA GAG AGG GAG AAG 1259 
Arg Arg Gin Thr He Val Pro Val Cys Ser Tyr Glu Glu Arg Glu Lys 
225 230 235 240 

CCC AAC TGT TTG AAT TTG CAG GAC TCC TGC AAG ACG AAT TAC ATC TGC 1307 
Pro Asn Cys Leu Asn Leu Gin Asp Ser Cys Lys Thr Asn Tyr He Cys 
245 250 255 

AGA TCT CGC CTT GCG GAT TTT TTT ACC AAC TGC CAG CCA GAG TCA AGG 1355 
Arg Ser Arg Leu Ala Asp Phe Phe Thr Asn Cys Gin Pro Glu Ser Arg 
260 265 270 

TCT GTC AGC AGC TGT CTA AAG GAA AAC TAC GCT GAC TGC CTC CTC GCC 1403 
Ser Val Ser Ser Cys Leu Lys Glu Asn Tyr Ala Asp Cys Leu Leu Ala 
275 280 285 

TAC TCG GGG CTT ATT GGC ACA GTC ATG ACC CCC AAC TAC ATA GAC TCC 1451 
Tyr Ser Gly Leu He Gly Thr Val Met Thr Pro Asn Tyr He Asp Ser 
290 295 300 

AGT AGC CTC AGT GTG GCC CCA TGG TGT GAC TGC AGC AAC AGT GGG AAC 1499 
Ser Ser Leu Ser Val Ala Pro Trp Cys Asp Cys Ser Asn Ser Gly Asn 
305 310 315 320 

GAC CTA GAA GAG TGC TTG AAA TTT TTG AAT TTC TTC AAG GAC AAT ACA 1547 
Asp Leu Glu Glu Cys Leu Lys Phe Leu Asn Phe Phe Lys Asp Asn Thr 
325 330 335 

TGT CTT AAA AAT GCA ATT CAA GCC TTT GGC AAT GGC TCC GAT GTG ACC 159 5 

Cys Leu Lys Asn Ala He Gin Ala Phe Gly Asn Gly Ser Asp Val Thr 
340 345 350 

GTG TGG CAG CCA GCC TTC CCA GTA CAG ACC ACC ACT GCC ACT ACC ACC 164 3 

Val Trp Gin Pro Ala Phe Pro Val Gin Thr Thr Thr Ala Thr Thr Thr 
355 360 365 

ACT GCC CTC CGG GTT AAG AAC AAG CCC CTG GGG CCA GCA GGG TCT GAG 1691 
Thr Ala Leu Arg Val Lys Asn Lys Pro Leu Gly Pro Ala Gly Ser Glu 
370 375 380 

AAT GAA ATT CCC ACT CAT GTT TTG CCA CCG TGT GCA AAT TTA CAG GCA 1739 
Asn Glu He Pro Thr His Val Leu Pro Pro Cys Ala Asn Leu Gin Ala 
385 390 395 400 
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CAG AAG CTG AAA TCC AAT GTG TCG GGC AAT AC A CAC CTC TGT ATT TCC 1787 
Gin Lys Leu Lys Ser Asn Val Ser Gly Asn Thr His Leu Cys lie Ser 
405 410 415 

AAT GGT AAT TAT GAA AAA GAA GGT CTC GGT GCT TCC AGC CAC ATA ACC 1835 
Asn Gly Asn Tyr Glu Lys Glu Gly Leu Gly Ala Ser Ser His lie Thr 
420 425 430 

ACA AAA TCA ATG GCT GCT CCT CCA AGC TGT GGT CTG AGC CCA CTG CTG 1883 
Thr Lys Ser Met Ala Ala Pro Pro Ser Cys Gly Leu Ser Pro Leu Leu 
435 440 445 

GTC CTG GTG GTA ACC GCT CTG TCC ACC CTA TTA TCT TTA ACA GAA ACA 1931 
Val Leu Val Val Thr Ala Leu Ser Thr Leu Leu Ser Leu Thr Glu Thr 
450 455 460 

TCA TAG CTGCATTAAA AAAATACAAT ATGGACATGT AAAAAGACAA AAACCAAGTT 19 87 

Ser * 

465 

ATCTGTTTCC TGTTCTCTTG TATAGCTGAA ATTCCAGTTT AGGAGCTCAG TTGAGAAACA 2 047 

GTTCCATTCA ACTGGAACAT TTTTTTTTTT NCCTTTTAAG AAAGCTTCTT GTGATCCTTC 2107 

GGGGCTTCTG TGAAAAACCT GATGCAGTGC TCCATCCAAA CTCAGAAGGC TTTGGGATAT 2167 

GCTGTATTTT AAAGGGACAG TTTGTAACTT GGGCTGTAAA GCAAACTGGG GCTGTGTTTT 2227 

CGATGATGAT GATCATCATG ATCATGATNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 2287 

NNNNNNNGAT TTTAACAGTT TTACTTCTGG CCTTTCCTAG CTAGAGAAGG AGTTAATATT 2347 

TCTAAGGTAA CTCCCATATC TCCTTTAATG ACATTGATTT CTAATGATAT AAATTTCAGC 2407 

CTACATTGAT GCCAAGCTTT TTTGCCACAA AGAAGATTCT TACCAAGAGT GGGCTTTGTG 2467 

GAAACAGCTG GTACTGATGT TCACCTTTAT ATATGTACTA GCATTTTCCA CGCTGATGTT 2527 

TATGTACTGT AAACAGTTCT GCACTCTTGT ACAAAAGAAA AAACACCTGT CACATCCAAA 2587 

TATAGTATCT GTCTTTTCGT CAAAATAGAG AGTGGGGAAT GAGTGTGCCG ATTCAATACC 2647 

TCAATCCCTG AACGACACTC TCCTAATCCT AAGCCTTACC TGAGTGAGAA GCCCTTTACC 2707 

TAACAAAAGT CCAATATAGC TGAAATGTCG CTCTAATACT CTTTACACAT ATGAGGTTAT 2767 

ATGTAGAAAA AAATTTTACT ACTAAATGAT TTCAACTATT GGCTTTCTAT ATTTTGAAAG 2827 

TAATGATATT GTCTCATTTT TTTACTGATG GTTTAATACA AAATACACAG AGCTTGTTTC 2 887 

CCCTCATAAG TAGTGTTCGC TCTGATATGA ACTTCACAAA TACAGCTCAT CAAAAGCAGA 2947 

CTCTGAGAAG CCTCGTGCTG TAGCAGAAAG TTCTGCATCA TGTGACTGTG GACAGGCAGG 3007 

AGGAAACAGA ACAGACAAGC ATTGTCTTTT GTCATTGCTC GAAGTGCAAG CGTGCATACC 3067 
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W 104 W 

TGTGGAGGGA ACTGGTGGCT GCTTGTAAAT GTTCTGCAGC ATCTCTTGAC ACACTTGTCA 3127 
TGACACAATC CAGTACCTTG GTTTTCAGGT TATCTGACAA AGGCAGCTTT GATTGGGACA 3187 
TGGAGGCATG GGCAGGCCGG AA 



(2) INFORMATION FOR SEQ ID NO : 6 : 

<i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 466 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Phe Leu Ala Thr Leu Tyr Phe Ala Leu Pro Leu Leu Asp Leu Leu 
15 10 is 

Leu Ser Ala Glu Val Ser Gly Gly Asp Arg Leu Asp Cys Val Lys Ala 
20 25 30 

Ser Asp Gin Cys Leu Lys Glu Gin Ser Cys Ser Thr Lys Tyr Arg Thr 
35 40 45 

Leu Arg Gin Cys Val Ala Gly Lys Glu Thr Asn Phe Ser Leu Ala Ser 
50 55 60 

Gly Leu Glu Ala Lys Asp Glu Cys Arg Ser Ala Met Glu Ala Leu Lys 
65 70 75 80 

Gin Lys Ser Leu Tyr Asn Cys Arg Cys Lys Arg Gly Met Lys Lys Glu 
85 90 95 

Lys Asn Cys Leu Arg He Tyr Trp Ser Met Tyr Gin Ser Leu Gin Gly 
100 105 no 

Asn Asp Leu Leu Glu Asp Ser Pro Tyr Glu Pro Val Asn Ser Arg Leu 
115 120 125 

Ser Asp He Phe Arg Val Val Pro Phe He Ser Asp Val Phe Gin Gin 
130 135 140 

Val Glu His He Pro Lys Gly Asn Asn Cys Leu Asp Ala Ala Lys Ala 
145 150 155 160 

Cys Asn Leu Asp Asp He Cys Lys Lys Tyr Arg Ser Ala Tyr He Thr 
165 170 175 

Pro Cys Thr Thr Ser Val Ser Xaa Asp Val Cys Asn Arg Arg Lys Cys 
180 185 190 

His Lys Ala Leu Arg Gin Phe Phe Asp Lys Val Pro Ala Lys His Ser 
195 200 205 



3209 
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Tyr Gly Met Leu Phe Cys Ser Cys Arg Asp lie Ala Cys Thr Glu Arg 
210 215 220 

Arg Arg Gin Thr lie Val Pro Val Cys Ser Tyr Glu Glu Arg Glu Lys 
225 230 235 240 

Pro Asn Cys Leu Asn Leu Gin Asp Ser Cys Lys Thr Asn Tyr lie Cys 
245 250 255 

Arg Ser Arg Leu Ala Asp Phe Phe Thr Asn Cys Gin Pro Glu Ser Arg 
260 265 270 

Ser Val Ser Ser Cys Leu Lys Glu Asn Tyr Ala Asp Cys Leu Leu Ala 
275 260 285 

Tyr Ser Gly Leu lie Gly Thr Val Met Thr Pro Asn Tyr He Asp Ser 
290 295 300 

Ser Ser Leu Ser Val Ala Pro Trp Cys Asp Cys Ser Asn Ser Gly Asn 
305 310 315 320 

Asp Leu Glu Glu Cys Leu Lys Phe Leu Asn Phe Phe Lys Asp Asn Thr 
325 330 335 

Cys Leu Lys Asn Ala He Gin Ala Phe Gly Asn Gly Ser Asp Val Thr 
340 345 350 

val Trp Gin Pro Ala Phe Pro Val Gin Thr Thr Thr Ala Thr Thr Thr 
355 360 365 

Thr Ala Leu Arg Val Lys Asn Lys Pro Leu Gly Pro Ala Gly Ser Glu 
370 375 380 

Asn Glu He Pro Thr His Val Leu Pro Pro Cys Ala Asn Leu Gin Ala 
385 390 395 400 

Gin Lys Leu Lys Ser Asn Val Ser Gly Asn Thr His Leu Cys He Ser 
405 410 415 

Asn Gly Asn Tyr Glu Lys Glu Gly Leu Gly Ala Ser Ser His He Thr 
420 425 430 

Thr Lys Ser Met Ala Ala Pro Pro Ser Cys Gly Leu Ser Pro Leu Leu 
435 440 445 

Val Leu Val Val Thr Ala Leu Ser Thr Leu Leu Ser Leu Thr Glu Thr 
450 455 460 



Ser * 
465 
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(2) INFORMATION FOR SEQ ID NO:?: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY : mis cofeature 

(B) LOCATION: 1..508 

(D) OTHER INFORMATION: /note= N l to 508 is -237 to 272 of 
Figure 5 Hsgr-21af" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

TCTGGCCTCG GAACACGCCA TTCTCCGCGC CGCTTCCAAT AACCACTAAC ATCCCTAACG 60 

AGCATCCGAG CCGAGGGCTC TGCTCGGAAA TCGTCCTGGC CCAACTCGGC CCTTCGAGCT 120 

CTCGAAGATT ACCGCATCTA TTTTTTTTTT CTTTTTTTTC TTTTCCTAGC GCAGATAAAG 180 

TGAGCCCGGA AAGGGAAGGA GGGGGCGGGG ACACCATTGC CCTGAAAGAA TAAATAAGTA 240 

AATAAACAAA CTGGCTCCTC GCCGCAGCTG GACGCGGTCG GTTGAGTCCA GGTTGGGTCG 300 

GACCTGAACC CCTAAAAGCG GAACCGCCTC CCGCCCTCGC CATCCCGGAG CTGAGTCGCC 360 

GGCGGCGGTG GCTGCTGCCA GACCCGGAGT TTCCTCTTTC ACTGGATGGA GCTGAACTTT 420 

GGGCGGCCAG AGCAGCACAG CTGTCCGGGG ATCGCTGCAC GCTGAGCTCC CTCGGCAAGA 480 

CCCAGCGGCG GCTCGGGATT TTTTTGGG 508 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 510 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY : misc__f eature 

(B) LOCATION: 1..510 

(D) OTHER INFORMATION: /note= »1 to 510 is -237 to 272 of 
Figure 5 Hsgr-21bf" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



AATCTGGCCT CGGAACACGC CATTCTCCGC GCCGCTTCCA ATAACCACTA ACATCCCTAA 60 
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CGAGCATCCG AGCCGAGGGC TCTGCTCGGA AATCGTCCTG GCCCAACTCG GCCCTTCGAG 
CTCTCGAAGA TTACCGCATC TATTTTTTTT TTCTTTTTTT TCTTTTCCTA GCGCAGATAA 
AGTGAGCCCG GAAAGGGAAG GAGGGGGCGG GGACACCATT GCCCTGAAAG AATAAATAAG 
TAAATAAACA AACTGGCTCC TCGCCGCAGC TGGACGCGGT CGGTTGAGTC CAGGTTGGGT 
CGGACCTGAA CCCCTAAAAG CGGAACCGCC TCCCGCCCTC GCCATCCCGG AGCTGAGTCG 
CCGGCGGCGG TGGCTGCTGC CAGACCCGGA GTTTCCTCTT TCACTGGATG GAGCTGAACT 
TTGGGCGGCC AGAGCAGCAC AGCTGTCCGG GGATCGCTGC ACGCTGAGCT CCCTCGGCAA 
GACCCAGCGG CGGCTCGGGA TTTTTTTGGG 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1927 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 538, ,1926 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..537 

<D) OTHER INFORMATION: /note= "1 to 537 is -235 to 301 of 
Figure 5 21acon" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TCTGGCCTCG GAACACGCCA TTCTCCGCGC CGCTTCCAAT AACCACTAAC ATCCCTAACG 

AGCATCCGAG CCGAGGGCTC TGCTCGGAAA TCGTCCTGGC CCAACTCGGC CCTTCGAGCT 

CTCGAAGATT ACCGCATCTA TTTTTTTTTT CTTTTTTTTC TTTTCCTAGC GCAGATAAAG 

TGAGCCCGGA AAGGGAAGGA GGGGGCGGGG ACACCATTGC CCTGAAAGAA TAAATAAGTA 

AATAAACAAA CTGGCTCCTC GCCGCAGCTG GACGCGGTCG GTTGAGTCCA GGTTGGGTCG 

GACCTGAACC CCTAAAAGCG GAACCGCCTC CCGCCCTCGC CATCCCGGAG CTGAGTCGCC 

GGCGGCGGTG GCTGCTGCCA GACCCGGAGT TTCCTCTTTC ACTGGATGGA GCTGAACTTT 

GGGCGGCCAG AGCAGCACAG CTGTCCGGGG ATCGCTGCAC GCTGAGCTCC CTCGGCAAGA 

CCCAGCGGCG GCTCGGGATT TTTTTGGGGG GGCGGGGACC AGCCCCGCGC CGGCACC 



120 
180 
240 
300 
360 
420 
480 
510 



60 
120 
180 
240 
300 
360 
420 
480 
537 



WO 97/40152 




PCIYUS97/06281 



ATG TTC CTG GCG NCC CTG TAC TTC GCG CTG CCG CTC TTG GAC TTG CTC 
Met Phe Leu Ala Xaa Leu Tyr Phe Ala Leu Pro Leu Leu Asp Leu Leu 
15 10 15 



585 



CTG TCG GCC GAA GTG AGC GGC GGA GAC CGC CTG GAT TGC GTG AAA GCC 
Leu Ser Ala Glu Val Ser Gly Gly Asp Arg Leu Asp Cys Vai Lys Ala 



633 



AGT GAT CAG TGC CTG AAG GAG CAG AGC TGC AGC ACC AAG TAC CGC ACG 
Ser Asp Gin Cys Leu Lys Glu Gin Ser Cys Ser, Thr Lys Tyr Arg Thr 
35 40 45 



681 



CTA AGG CAG TGC GTG GCG GGC AAG GAG ACC AAC TTC AGC CTG GCA TCC 
Leu Arg Gin Cys Val Ala Gly Lys Glu Thr Asn Phe Ser Leu Ala Ser 
50 55 60 



729 



GGC CTG GAG GCC AAG GAT GAG TGC CGC AGC GCC ATG GAG GCC CTG AAG 
Gly Leu Glu Ala Lys Asp Glu Cys Arg Ser Ala Met Glu Ala Leu Lys 
65 70 75 80 



777 



CAG AAG TCG CTC TAC AAC TGC CGC TGC AAG CGG GGT ATG AAG AAG GAG 
Gin Lys Ser Leu Tyr Asn Cys Arg Cys Lys Arg Gly Met Lys Lys Glu 
85 90 95 



825 



AAG AAC TGC CTG CGC ATT TAC TGG AGC ATG TAC CAG AGC CTG CAG GGA 
Lys Asn Cys Leu Arg lie Tyr Trp Ser Met Tyr Gin Ser Leu Gin Gly 
100 105 110 



873 



AAT GAT CTG CTG GAG GAT TCC CCA TAT GAA CCA GTT AAC AGC AGA TTG 
Asn Asp Leu Leu Glu Asp Ser Pro Tyr Glu Pro Val Asn Ser Arg Leu 
115 120 125 



921 



TCA GAT ATA TTC CGG GTG GTC CCA TTC ATA TCA GAT GTT TTT CAG CAA 
Ser Asp He Phe Arg Val Val Pro Phe He Ser Asp Val Phe Gin Gin 
130 135 140 



969 



GTG GAG CAC ATT CCC AAA GGG AAC AAC TGC CTG GAT GCA GCG AAG GCC 
Val Glu His He Pro Lys Gly Asn Asn Cys Leu Asp Ala Ala Lys Ala 
145 150 155 160 



1017 



TGC AAC CTC GAC GAC ATT TGC AAG AAG TAC AGG TCG GCG TAC ATC ACC 
Cys Asn Leu Asp Asp He Cys Lys Lys Tyr Arg Ser Ala Tyr He Thr 
165 170 175 



1065 



CCG TGC ACC ACC AGC GTG TCC AAC GAT GTC TGC AAC CGC CGC AAG TGC 
Pro Cys Thr Thr Ser Val Ser Asn Asp Val Cys Asn Arg Arg Lys Cys 
180 185 190 



1113 



CAC AAG GCC CTC CGG CAG TTC TTT GAC AAG GTC CCG GCC AAG CAC AGC 
His Lys Ala Leu Arg Gin Phe Phe Asp Lys Val Pro Ala Lys His Ser 
195 200 205 



1161 



TAC GGA ATG CTC TTC TGC TCC TGC CGG GAC ATC GCC TGC ACA GAG CGG 
Tyr Gly Met Leu Phe Cys Ser Cys Arg Asp He Ala Cys Thr Glu Arg 
210 215 220 



1209 
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AGG CGA CAG ACC ATC GTG CCT GTG TGC TCC TAT GAA GAG AGG GAG AAG 1257 
Arg Arg Gin Thr lie Val Pro Val Cys Ser Tyr Glu Glu Arg Glu Lys 
225 230 235 240 

CCC AAC TGT TTG AAT TTG CAG GAC TCC TGC AAG ACG AAT TAC ATC TGC 1305 
Pro Asn Cys Leu Asn Leu Gin Asp Ser Cys Lys Thr Asn Tyr lie Cys 
245 250 255 

AGA TCT CGC CTT GCG GAT TTT TTT ACC AAC TGC CAG CCA GAG TCA AGG 13 53 

Arg Ser Arg Leu Ala Asp Phe Phe Thr Asn Cys Gin Pro Glu Ser Arg 
260 265 270 

TCT GTC AGC AGC TGT CTA AAG GAA AAC TAC GCT GAC TGC CTC CTC GCC 1401 
Ser Val Ser Ser Cys Leu Lys Glu Asn Tyr Ala Asp Cys Leu Leu Ala 
275 280 285 

TAC TCG GGG CTT ATT GGC AC A GTC ATG ACC CCC AAC TAC ATA GAC TCC 1449 
Tyr Ser Gly Leu lie Gly Thr Val Met Thr Pro Asn Tyr He Asp Ser 
290 295 300 

AGT AGC CTC AGT GTG GCC CCA TGG TGT GAC TGC AGC AAC AGT GGG AAC 1497 
Ser Ser Leu Ser Val Ala Pro Trp Cys Asp Cys Ser Asn Ser Gly Asn 
305 310 315 320 

GAC CTA GAA GAG TGC TTG AAA TTT TTG AAT TTC TTC AAG GAC AAT ACA 1545 
Asp Leu Glu Glu Cys Leu Lys Phe Leu Asn Phe Phe Lys Asp Asn Thr 
325 330 335 

TGT CTT AAA AAT GCA ATT CAA GCC TTT GGC AAT GGC TCC GAT GTG ACC 159 3 

Cys Leu Lys Asn Ala He Gin Ala Phe Gly Asn Gly Ser Asp Val Thr 
340 345 350 

GTG TGG CAG CCA GCC TTC CCA GTA CAG ACC ACC ACT GCC ACT ACC ACC 1641 
Val Trp Gin Pro Ala Phe Pro val Gin Thr Thr Thr Ala Thr Thr Thr 
355 360 365 

ACT GCC CTC CGG GTT AAG AAC AAG CCC CTG GGG CCA GCA GGG TCT GAG 1689 
Thr Ala Leu Arg Val Lys Asn Lys Pro Leu Gly Pro Ala Gly Ser Glu 
370 375 380 

AAT GAA ATT CCC ACT CAT GTT TTG CCA CCG TGT GCA AAT TTA CAG GCA 17 37 

Asn Glu He Pro Thr His Val Leu Pro Pro Cys Ala Asn Leu Gin Ala 
385 390 395 400 

CAG AAG CTG AAA TCC AAT GTG TCG GGC AAT ACA CAC CTC TGT ATT TCC 1785 
Gin Lys Leu Lys Ser Asn Val Ser Gly Asn Thr His Leu Cys He Ser 
405 410 415 

AAT GGT AAT TAT GAA AAA GAA GGT CTC GGT GCT TCC AGC CAC ATA ACC 183 3 

Asn Gly Asn Tyr Glu Lys Glu Gly Leu Gly Ala Ser Ser His He Thr 
420 425 430 

ACA AAA TCA ATG GCT GCT CCT CCA AGC TGT GGT CTG AGC CCA CTG CTG 1881 
Thr Lys Ser Met Ala Ala Pro Pro Ser Cys Gly Leu Ser Pro Leu Leu 
435 440 445 
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GTC CTG GTG GTA ACC GCT CTG TCC ACC CTA TTA TCT TTA ACA GAA 1926 

Val Leu Val Val Thr Ala Leu Ser Thr Leu Leu Ser Leu Thr Glu 
450 455 460 

A 1927 



(2) INFORMATION FOR SEQ ID NO: 10; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 463 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Phe Leu Ala Xaa Leu Tyr Phe Ala Leu Pro Leu Leu Asp Leu Leu 
15 10 15 

Leu Ser Ala Glu Val Ser Gly Gly Asp Arg Leu Asp Cys Val Lys Ala 
20 25 30 

Ser Asp Gin Cys Leu Lys Glu Gin Ser Cys Ser Thr Lys Tyr Arg Thr 
35 40 45 

Leu Arg Gin Cys Val Ala Gly Lys Glu Thr Asn Phe Ser Leu Ala Ser 
50 55 60 

Gly Leu Glu Ala Lys Asp Glu Cys Arg Ser Ala Met Glu Ala Leu Lys 
65 70 75 80 

Gin Lys Ser Leu Tyr Asn Cys Arg Cys Lys Arg Gly Met Lys Lys Glu 
85 90 95 

Lys Asn Cys Leu Arg lie Tyr Trp Ser Met Tyr Gin Ser Leu Gin Gly 
100 105 110 

Asn Asp Leu Leu Glu Asp Ser Pro Tyr Glu Pro Val Asn Ser Arg Leu 
115 120 125 

Ser Asp lie Phe Arg Val Val Pro Phe He Ser Asp Val Phe Gin Gin 
130 135 140 

Val Glu His He Pro Lys Gly Asn Asn Cys Leu Asp Ala Ala Lys Ala 
145 150 155 160 

Cys Asn Leu Asp Asp He Cys Lys Lys Tyr Arg Ser Ala Tyr He Thr 
165 170 175 

Pro Cys Thr Thr Ser Val Ser Asn Asp Val Cys Asn Arg Arg Lys Cys 
180 185 190 



His Lys Ala Leu Arg Gin Phe Phe Asp Lys Val Pro Ala Lys His Ser 
195 200 205 
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Tyr Gly Met Leu Phe Cys Ser Cys Arg Asp lie Ala Cys Thr Glu Arg 
210 215 220 

Arg Arg Gin Thr lie Val Pro Val Cys Ser Tyr Glu Glu Arg Glu Lys 
225 230 235 240 

Pro Asn Cys Leu Asn Leu Gin Asp Ser Cys Lys Thr Asn Tyr lie Cys 
245 250 255 

Arg Ser Arg Leu Ala Asp Phe Phe Thr Asn Cys Gin Pro Glu Ser Arg 
260 265 270 

Ser Val Ser Ser Cys Leu Lys Glu Asn Tyr Ala Asp Cys Leu Leu Ala 
275 280 285 

Tyr Ser Gly Leu lie Gly Thr Val Met Thr Pro Asn Tyr lie Asp Ser 
290 295 300 

Ser Ser Leu Ser Val Ala Pro Trp Cys Asp Cys Ser Asn Ser Gly Asn 
305 310 315 320 

Asp Leu Glu Glu Cys Leu Lys Phe Leu Asn Phe Phe Lys Asp Asn Thr 
325 330 335 

Cys Leu Lys Asn Ala lie Gin Ala Phe Gly Asn Gly Ser Asp Val Thr 
340 345 350 

Val Trp Gin Pro Ala Phe Pro Val Gin Thr Thr Thr Ala Thr Thr Thr 
355 360 365 

Thr Ala Leu Arg Val Lys Asn Lys Pro Leu Gly Pro Ala Gly Ser Glu 
370 375 380 

Asn Glu lie Pro Thr His Val Leu Pro Pro Cys Ala Asn Leu Gin Ala 
385 390 395 400 

Gin Lys Leu Lys Ser Asn Val Ser Gly Asn Thr His Leu Cys He Ser 
405 410 415 

Asn Gly Asn Tyr Glu Lys Glu Gly Leu Gly Ala Ser Ser His He Thr 
420 425 430 

Thr Lys Ser Met Ala Ala Pro Pro Ser Cys Gly Leu Ser Pro Leu Leu 
435 440 445 



Val Leu Val Val Thr Ala Leu Ser Thr Leu Leu Ser Leu Thr Glu 
450 455 460 



12) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1929 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

v a f uuurt x iun ; 34U . . 17^0 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..539 

(D) OTHER INFORMATION: /note= "1 to 539 is -237 to 301 of 
Figure 5 21bcon M 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

AATCTGGCCT CGGAACACGC CATTCTCCGC GCCGCTTCCA ATAACCACTA ACATCCCTAA 60 

CGAGCATCCG AGCCGAGGGC TCTGCTCGGA AATCGTCCTG GCCCAACTCG GCCCTTCGAG 120 

CTCTCGAAGA TTACCGCATC TATTTTTTTT TTCTTTTTTT TCTTTTCCTA GCGCAGATAA 180 

AGTGAGCCCG GAAAGGGAAG GAGGGGGCGG GGACACCATT GCCCTGAAAG AATAAATAAG 240 

TAAATAAACA AACTGGCTCC TCGCCGCAGC TGGACGCGGT CGGTTGAGTC CAGGTTGGGT 300 

CGGACCTGAA CCCCTAAAAG CGGAACCGCC TCCCGCCCTC GCCATCCCGG AGCTGAGTCG 360 

CCGGCGGCGG TGGCTGCTGC CAGACCCGGA GTTTCCTCTT TCACTGGATG GAGCTGAACT 420 

TTGGGCGGCC AGAGCAGCAC AGCTGTCCGG GGATCGCTGC ACGCTGAGCT CCCTCGGCAA 480 

GACCCAGCGG CGGCTCGGGA TTTTTTTGGG GGGGCGGGGA CCAGCCCCGC GCCGGCACC 539 

ATG TTC CTG GCG ACC CTG TAC TTC GCG CTG CCG CTC TTG GAC TTG CTC 587 
Met Phe Leu Ala Thr Leu Tyr Phe Ala Leu Pro Leu Leu Asp Leu Leu 
15 10 15 

CTG TCG GCC GAA GTG AGC GGC GGA GAC CGC CTG GAT TGC GTG AAA GCC 635 
Leu Ser Ala Glu Val Ser Gly Gly Asp Arg Leu Asp Cys Val Lys Ala 
20 25 30 

AGT GAT CAG TGC CTG AAG GAG CAG AGC TGC AGC ACC AAG TAC CGC ACG 683 
Ser Asp Gin Cys Leu Lys Glu Gin Ser Cys Ser Thr Lys Tyr Arg Thr 
35 40 45 

CTA AGG CAG TGC GTG GCG GGC AAG GAG ACC AAC TTC AGC CTG GCA TCC 731 
Leu Arg Gin Cys Val Ala Gly Lys Glu Thr Asn Phe Ser Leu Ala Ser 
50 55 60 



GGC CTG GAG GCC AAG GAT GAG TGC CGC AGC GCC ATG GAG GCC CTG AAG 
Gly Leu Glu Ala Lys Asp Glu Cys Arg Ser Ala Met Glu Ala Leu Lys 
65 70 75 80 



779 
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CAG AAG TCG CTC TAC AAC TGC CGC TGC AAG CGG GGT ATG AAG AAG GAG 827 
Gin Lys Ser Leu Tyr Asn Cys Arg Cys Lys Arg Gly Met Lys Lys Glu 
85 90 95 

AAG AAC TGC CTG CGC ATT TAC TGG AGC ATG TAC CAG AGC CTG CAG GGA 87 5 

Lys Asn Cys Leu Arg He Tyr Tip Ser Met Tyr Gin Ser Leu Gin Gly 
100 105 110 

AAT GAT CTG CTG GAG GAT TCC CCA TAT GAA CCA GTT AAC AGC AGA TTG 923 
Asn Asp Leu Leu Glu Asp Ser Pro Tyr Glu Pro Val Asn Ser Arg Leu 
115 120 125 

TCA GAT ATA TTC CGG GTG GTC CCA TTC ATA TCA GAT GTT TTT CAG CAA 971 
Ser Asp He Phe Arg Val Val Pro Phe He Ser Asp Val Phe Gin Gin 
130 135 140 

GTG GAG CAC ATT CCC AAA GGG AAC AAC TGC CTG GAT GCA GCG AAG GCC 1019 
Val Glu His lie Pro Lys Gly Asn Asn Cys Leu Asp Ala Ala Lys Ala 
145 150 155 160 

TGC AAC CTC GAC GAC ATT TGC AAG AAG TAC AGG TCG GCG TAC ATC ACC 1067 
Cys Asn Leu Asp Asp He Cys Lys Lys Tyr Arg Ser Ala Tyr He Thr 
165 170 175 

CCG TGC ACC ACC AGC GTG TCC AAC GAT GTC TGC AAC CGC CGC AAG TGC 1115 
Pro Cys Thr Thr Ser Val Ser Asn Asp Val Cys Asn Arg Arg Lys Cys 
180 185 190 

CAC AAG GCC CTC CGG CAG TTC TTT GAC AAG GTC CCG GCC AAG CAC AGC 1163 
His Lys Ala Leu Arg Gin Phe Phe Asp Lys Val Pro Ala Lys His Ser 
195 200 205 

TAC GGA ATG CTC TTC TGC TCC TGC CGG GAC ATC GCC TGC ACA GAG CGG 1211 
Tyr Gly Met Leu Phe Cys Ser Cys Arg Asp He Ala Cys Thr Glu Arg 
210 215 220 

AGG CGA CAG ACC ATC GTG CCT GTG TGC TCC TAT GAA GAG AGG GAG AAG 1259 
Arg Arg Gin Thr He Val Pro Val Cys Ser Tyr Glu Glu Arg Glu Lys 
225 230 235 240 

CCC AAC TGT TTG AAT TTG CAG GAC TCC TGC AAG ACG AAT TAC ATC TGC 1307 
Pro Asn Cys Leu Asn Leu Gin Asp Ser Cys Lys Thr Asn Tyr He Cys 
245 250 255 

AGA TCT CGC CTT GCG GAT TTT TTT ACC AAC TGC CAG CCA GAG TCA AGG 1355 
Arg Ser Arg Leu Ala Asp Phe Phe Thr Asn Cys Gin Pro Glu Ser Arg 
260 265 270 

TCT GTC AGC AGC TGT CTA AAG GAA AAC TAC GCT GAC TGC CTC CTC GCC 1403 
Ser Val Ser Ser Cys Leu Lys Glu Asn Tyr Ala Asp Cys Leu Leu Ala 
275 280 285 

TAC TCG GGG CTT ATT GGC ACA GTC ATG ACC CCC AAC TAC ATA GAC TCC 1451 
Tyr Ser Gly Leu He Gly Thr Val Met Thr Pro Asn Tyr lie Asp Ser 
290 295 300 
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AGT AGC CTC AGT GTG GCC CCA TGG TGT GAC TGC AGC AAC AGT GGG AAC 1499 

Ser Ser Leu Ser Val Ala Pro Trp Cys Asp Cys Ser Asn Ser Gly Asn 

305 310 315 320 

GAC CTA GAA GAG TGC TTG AAA TTT TTG AAT TTC TTC AAG GAC AAT ACA 1547 

Asp Leu Glu Glu Cys Leu Lys Phe Leu Asn Phe Phe Lys Asp Asn Thr 
325 330 335 

TGT CTT AAA AAT GCA ATT CAA GCC TTT GGC AAT GGC TCC GAT GTG ACC 159 5 

Cys Leu Lys Asn Ala lie Gin Ala Phe Gly Asn Gly Ser Asp Val Thr 
340 345 350 

GTG TGG CAG CCA GCC TTC CCA GTA CAG ACC ACC ACT GCC ACT ACC ACC 164 3 

Val Trp Gin Pro Ala Phe Pro Val Gin Thr Thr Thr Ala Thr Thr Thr 
355 360 365 

ACT GCC CTC CGG GTT AAG AAC AAG CCC CTG GGG CCA GCA GGG TCT GAG 1691 

Thr Ala Leu Arg Val Lys Asn Lys Pro Leu Gly Pro Ala Gly Ser Glu 
370 375 380 

AAT GAA ATT CCC ACT CAT GTT TTG CCA CCG TGT GCA AAT TTA CAG GCA 1739 

Asn Glu lie Pro Thr His Val Leu Pro Pro Cys Ala Asn Leu Gin Ala 

385 390 395 400 

CAG AAG CTG AAA TCC AAT GTG TCG GGC AAT ACA CAC CTC TGT ATT TCC 1787 

Gin Lys Leu Lys Ser Asn Val Ser Gly Asn Thr His Leu Cys lie Ser 
405 410 415 

AAT GGT AAT TAT GAA AAA GAA GGT CTC GGT GCT TCC AGC CAC ATA ACC 183 5 

Asn Gly Asn Tyr Glu Lys Glu Gly Leu Gly Ala Ser Ser His lie Thr 
420 425 430 

ACA AAA TCA ATG GCT GCT CCT CCA AGC TGT GGT CTG AGC CCA CTG CTG 188 3 

Thr Lys Ser Met Ala Ala Pro Pro Ser Cys Gly Leu Ser Pro Leu Leu 
435 440 445 

GTC CTG GTG GTA ACC GCT CTG TCC ACC CTA TTA TCT TTA ACA GAA 1928 

Val Leu Val Val Thr Ala Leu Ser Thr Leu Leu Ser Leu Thr Glu 
450 455 460 

A 1929 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 463 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Met 

1 



Phe Leu Ala Thr Leu Tyr Phe Ala Leu Pro Leu Leu Asp Leu Leu 
5 10 15 
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Leu Ser Ala Glu Val Ser Gly Gly Asp Arg Leu Asp Cys Val Lys Ala 
20 25 30 

Ser Asp Gin Cys Leu Lys Glu Gin Ser Cys Ser Thr Lys Tyr Arg Thr 
35 40 45 

Leu Arg Gin Cys Val Ala Gly Lys Glu Thr Asn Phe Ser Leu Ala Ser 
50 55 60 

Gly Leu Glu Ala Lys Asp Glu Cys Arg Ser Ala Met Glu Ala Leu Lys 
65 70 75 80 

Gin Lys Ser Leu Tyr Asn Cys Arg Cys Lys Arg Gly Met Lys Lys Glu 
85 90 95 

Lys Asn Cys Leu Arg lie Tyr Trp Ser Met Tyr Gin Ser Leu Gin Gly 
100 105 110 

Asn Asp Leu Leu Glu Asp Ser Pro Tyr Glu Pro Val Asn Ser Arg Leu 
115 120 125 

Ser Asp He Phe Arg Val Val Pro Phe He Ser Asp Val Phe Gin Gin 
130 135 140 

Val Glu His He Pro Lys Gly Asn Asn Cys Leu Asp Ala Ala Lys Ala 
145 150 155 160 

Cys Asn Leu Asp Asp He Cys Lys Lys Tyr Arg Ser Ala Tyr He Thr 
165 170 175 

Pro Cys Thr Thr Ser Val Ser Asn Asp Val Cys Asn Arg Arg Lys Cys 
180 185 190 

His Lys Ala Leu Arg Gin Phe Phe Asp Lys Val Pro Ala Lys His Ser 
195 200 205 

Tyr Gly Met Leu Phe Cys Ser Cys Arg Asp He Ala Cys Thr Glu Arg 
210 215 220 

Arg Arg Gin Thr He Val Pro Val Cys Ser Tyr Glu Glu Arg Glu Lys 
225 230 235 240 

Pro Asn Cys Leu Asn Leu Gin Asp Ser Cys Lys Thr Asn Tyr He Cys 
245 250 255 

Arg Ser Arg Leu Ala Asp Phe Phe Thr Asn Cys Gin Pro Glu Ser Arg 
260 265 270 



Ser Val Ser Ser Cys Leu Lys Glu Asn Tyr Ala Asp Cys Leu Leu Ala 
275 280 285 



Tyr Ser Gly Leu He Gly Thr Val Met Thr Pro Asn Tyr He Asp Ser 
290 295 300 
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Ser Ser Leu Ser Val Ala Pro Trp Cys Asp Cys Ser Asn Ser Gly Asn 
305 310 315 320 

Asp Leu Glu Glu Cys Leu Lys Phe Leu Asn Phe Phe Lys Asp Asn Thr 
325 330 335 

^.jf a u^skA uya j-t<3w a. j- win rkJLo. mc uxy ziaii oiy oei ASp v«tX Till" 

340 345 350 

Val Trp Gin Pro Ala Phe Pro Val Gin Thr Thr Thr Ala Thr Thr Thr 
355 360 365 

Thr Ala Leu Arg Val Lys Asn Lys Pro Leu Gly Pro Ala Gly Ser Glu 
370 375 380 

Asn Glu lie Pro Thr His Val Leu Pro Pro Cys Ala Asn Leu Gin Ala 
385 390 395 400 

Gin Lys Leu Lys Ser Asn Val Ser Gly Asn Thr His Leu Cys lie Ser 
405 410 415 

Asn Gly Asn Tyr Glu Lys Glu Gly Leu Gly Ala Ser Ser His lie Thr 
420 425 430 

Thr Lys Ser Met Ala Ala Pro Pro Ser Cys Gly Leu Ser Pro Leu Leu 
435 440 445 

Val Leu Val Val Thr Ala Leu Ser Thr Leu Leu Ser Leu Thr Glu 
450 455 460 



(2) INFORMATION FOR SEQ ID NO: 13: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 699 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..699 

(D) OTHER INFORMATION: /note= "1 to 699 is 814 to 1512 of 
Figure 5 Hsgr-29a" 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 2.. 697 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



G TCG GCG TAC ATC ACC CCG TGC ACC ACC AGC GTG TCC AAT GAT GTC 
Ser Ala Tyr lie Thr Pro Cys Thr Thr Ser Val Ser Asn Asp Val 
15 10 15 
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TGC AAC CGC CGC AAG TGC CAC AAG GCC CTC CGG CAG TTC TTT GAC AAG 94 
Cys Asn Arg Arg Lys Cys His Lys Ala Leu Arg Gin Phe Phe Asp Lys 
20 25 30 

GTC CCG GCC AAG CAC AGC TAC GGA ATG CTC TTC TGC TCC TGC CGG GAC 142 
Val Pro Ala Lys His Ser Tyr Gly Met Leu Phe Cys Ser Cys Arg Asp 
35 40 45 

ATC GCC TGC AC A GAG CGG AGG CGA CAG ACC ATC GTG CCT GTG TGC TCC 19 0 

lie Ala Cys Thr Glu Arg Arg Arg Gin Thr He Val Pro Val Cys Ser 
50 55 60 

TAT GAA GAG AGG GAG AAG CCC AAC TGT TTG AAT TTG CAG GAC TCC TGC 2 38 

Tyr Glu Glu Arg Glu Lys Pro Asn Cys Leu Asn Leu Gin Asp Ser Cys 
65 70 75 

AAG ACG AAT TAC ATC TGC AGA TCT CGC CTT GCG GAT TTT TTT ACC AAC 286 
Lys Thr Asn Tyr He Cys Arg Ser Arg Leu Ala Asp Phe Phe Thr Asn 
80 85 90 95 

TGC CAG CCA GAG TCA AGG TCT GTC AGC AGC TGT CTA AAG GAA AAC TAC 334 
Cys Gin Pro Glu Ser Arg Ser Val Ser Ser Cys Leu Lys Glu Asn Tyr 
100 105 110 

GCT GAC TGC CTC CTC GCC TAC TCG GGG CTT ATT GGC AC A GTC ATG ACC 382 
Ala Asp Cys Leu Leu Ala Tyr Ser Gly Leu He Gly Thr Val Met Thr 
115 120 125 

CCC AAC TAC ATA GAC TCC AGT AGC CTC AGT GTG GCC CCA TGG TGT GAC 4 30 

Pro Asn Tyr He Asp Ser Ser Ser Leu Ser Val Ala Pro Trp Cys Asp 
130 135 140 

TGC AGC AAC AGT GGG AAC GAC CTA GAA GAG TGC TTG AAA TTT TTG AAT 478 
Cys Ser Asn Ser Gly Asn Asp Leu Glu Glu Cys Leu Lys Phe Leu Asn 
145 150 155 

TTC TTC AAG GAC AAT ACA TGT CTT AAA AAT GCA ATT CAA GCC TTT GGC 526 
Phe Phe Lys Asp Asn Thr Cys Leu Lys Asn Ala He Gin Ala Phe Gly 
160 165 170 175 

AAT GGC TCC GAT GTG ACC GTG TGG CAG CCA GCC TTC CCA GTA CAG ACC 574 
Asn Gly Ser Asp Val Thr Val Trp Gin Pro Ala Phe Pro Val Gin Thr 
180 185 190 

ACC ACT GCC GCT ACC ACC ACT GCC CTC CGG GTT AAG AAC AAG CCC CTG 622 
Thr Thr Ala Ala Thr Thr Thr Ala Leu Arg Val Lys Asn Lys Pro Leu 
195 200 205 

GGG CCA GCA GGG TCT GAG AAT GAA ATT CCC ACT CAT GTT TTG CCA CCG 670 
Gly Pro Ala Gly Ser Glu Asn Glu He Pro Thr His Val Leu Pro Pro 
210 215 220 

TGT GCA AAT TTA CAG GCA CAG AAG CTG AA 699 
Cys Ala Asn Leu Gin Ala Gin Lys Leu 
225 230 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 232 amino acids 

(B) TYPE: amino acid 
/ n ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Ser Ala Tyr lie Thr Pro Cys Thr Thr Ser Val Ser Asn Asp Val Cys 
15 10 15 

Asn Arg Arg Lys Cys His Lys Ala Leu Arg Gin Phe Phe Asp Lys Val 
20 25 30 

Pro Ala Lys His Ser Tyr Gly Met Leu Phe Cys Ser Cys Arg Asp He 
35 40 45 

Ala Cys Thr Glu Arg Arg Arg Gin Thr He Val Pro Val Cys Ser Tyr 
50 55 60 

Glu Glu Arg Glu Lys Pro Asn Cys Leu Asn Leu Gin Asp Ser Cys Lys 
65 70 75 80 

Thr Asn Tyr He Cys Arg Ser Arg Leu Ala Asp Phe Phe Thr Asn Cys 
85 90 95 

Gin Pro Glu Ser Arg Ser Val Ser Ser Cys Leu Lys Glu Asn Tyr Ala 
100 105 110 

Asp Cys Leu Leu Ala Tyr Ser Gly Leu He Gly Thr Val Met Thr Pro 
115 120 125 

Asn Tyr He Asp Ser Ser Ser Leu Ser Val Ala Pro Trp Cys Asp Cys 
130 135 140 

Ser Asn Ser Gly Asn Asp Leu Glu Glu Cys Leu Lys Phe Leu Asn Phe 
145 150 155 160 

Phe Lys Asp Asn Thr Cys Leu Lys Asn Ala He Gin Ala Phe Gly Asn 
165 170 175 

Gly Ser Asp Val Thr Val Trp Gin Pro Ala Phe Pro Val Gin Thr Thr 
180 185 190 

Thr Ala Ala Thr Thr Thr Ala Leu Arg Val Lys Asn Lys Pro Leu Gly 
195 200 205 

Pro Ala Gly Ser Glu Asn Glu He Pro Thr His Val Leu Pro Pro Cys 
210 215 220 



Ala Asn Leu Gin Ala Gin Lys Leu 
225 230 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 2.. 886 

(ix) FEATURE: 

(A) NAME /KEY: misc_£ eature 

(B) LOCATION: 1..2157 

(D) OTHER INFORMATION: /note= "1 to 2157 is 814 to 2971 of 
Figure 5 29brc" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

G TCG GCG TAC ATC ACC CCG TGC ACC ACC AGC GTG TCC AAT GAT GTC 
Ser Ala Tyr He Thr Pro Cys Thr Thr Ser Val Ser Asn Asp Val 
1 5 10 15 

TGC AAC CGC CGC AAG TGC CAC AAG GCC CTC CGG CAG TTC TTT GAC AAG 
Cys Asn Arg Arg Lys Cys His Lys Ala Leu Arg Gin Phe Phe Asp Lys 
20 25 30 



46 



94 



GTC CCG GCC AAG CAC AGC TAC GGA ATG CTC TTC TGC TCC TGC CGG GAC 142 
Val Pro Ala Lys His Ser Tyr Gly Met Leu Phe Cys Ser Cys Arg Asp 
35 40 45 

ATC GCC TGC ACA GAG CGG AGG CGA CAG ACC ATC GTG CCT GTG TGC TCC 190 
He Ala Cys Thr Glu Arg Arg Arg Gin Thr He Val Pro Val Cys Ser 
50 55 60 

TAT GAA GAG AGG GAG AAG CCC AAC TGT TTG AAT TTG CAG GAC TCC TGC 238 
Tyr Glu Glu Arg Glu Lys Pro Asn Cys Leu Asn Leu Gin Asp Ser Cys 
65 70 75 

AAG ACG AAT TAC ATC TGC AGA TCT CGC CTT GCG GAT TTT TTT ACC AAC 286 
Lys Thr Asn Tyr He Cys Arg Ser Arg Leu Ala Asp Phe Phe Thr Asn 
80 85 90 95 

TGC CAG CCA GAG TCA AGG TCT GTC AGC AGC TGT CTA AAG GAA AAC TAC 334 
Cys Gin Pro Glu Ser Arg Ser Val Ser Ser Cys Leu Lys Glu Asn Tyr 
100 105 HO 

GCT GAC TGC CTC CTC GCC TAC TCG GGG CTT ATT GGC ACA GTC ATG ACC 382 
Ala Asp Cys Leu Leu Ala Tyr Ser Gly Leu He Gly Thr Val Met Thr 
115 120 125 
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CCC AAC TAG ATA GAC TCC AGT AGC CTC AGT GTG GCC CCA TGG TGT GAC 43 0 

Pro Asn Tyr lie Asp Ser Ser Ser Leu Ser Val Ala Pro Trp Cys Asp 
130 135 140 

TGC AGC AAC AGT GGG AAC GAC CTA GAA GAG TGC TTG AAA TTT TTG AAT 47 8 

Cys Ser Asn Ser Gly Asn Asp Leu Glu Glu Cys Leu Lys Phe Leu Asn 
145 150 155 

TTC TTC AAG GAC AAT AC A TGT CTT AAA AAT GCA ATT CAA GCC TTT GGC 526 
Phe Phe Lys Asp Asn Thr Cys Leu Lys Asn Ala lie Gin Ala Phe Gly 
160 165 170 175 

AAT GGC TCC GAT GTG ACC GTG TGG CAG CCA GCC TTC CCA GTA CAG ACC 574 
Asn Gly Ser Asp Val Thr Val Trp Gin Pro Ala Phe Pro Val Gin Thr 
180 185 190 

ACC ACT GCC GCT ACC ACC ACT GCC CTC CGG GTT AAG AAC AAG CCC CTG 622 
Thr Thr Ala Ala Thr Thr Thr Ala Leu Arg Val Lys Asn Lys Pro Leu 
195 200 205 

GGG CCA GCA GGG TCT GAG AAT GAA ATT CCC ACT CAT GTT TTG CCA CCG 670 
Gly Pro Ala Gly Ser Glu Asn Glu lie Pro Thr His Val Leu Pro Pro 
210 215 220 

TGT GCA AAT TTA CAG GCA CAG AAG CTG AAA TCC AAT GTG TCG GGC AAT 718 
Cys Ala Asn Leu Gin Ala Gin Lys Leu Lys Ser Asn Val Ser Gly Asn 
225 230 235 

ACA CAC CTC TGT ATT TCC AAT GGT AAT TAT GAA AAA GAA GGT CTC GGT 766 
Thr His Leu Cys He Ser Asn Gly Asn Tyr Glu Lys Glu Gly Leu Gly 
240 245 250 255 

GCT TCC AGC CAC ATA ACC ACA AAA TCA ATG GCT GCT CCT CCA AGC TGT 814 
Ala Ser Ser His He Thr Thr Lys Ser Met Ala Ala Pro Pro Ser Cys 
260 265 270 

GGT CTG AGC CCA CTG CTG GTC CTG GTG GTA ACC GCT CTG TCC ACC CTA 862 
Gly Leu Ser Pro Leu Leu Val Leu Val Val Thr Ala Leu Ser Thr Leu 
275 280 285 

TTA TCT TTA ACA GAA ACA TCA TAG CTGCATTAAA AAAATACAAT ATGGACATGT 916 
Leu Ser Leu Thr Glu Thr Ser * 
290 295 

AAAAAGACAA AAACCAAGTT ATCTGTTTCC TGTTCTCTTG TATAGCTGAA ATTCCAGTTT 976 

AGGAGCTCAG TTGAGAAACA GTTCCATTCA ACTGGAACAT TTTTTTTTTT CCTTTTAAGA 1036 

AAGCTTCTTG TGATCCTTCG GGGCTTCTGT GAAAAACCTG ATGCAGTGCT CCATCCAAAC 1096 

TCAGAAGGCT TTGGGATATG CTGTATTTTA AAGGGACAGT TTGTAACTTG GGCTGTAAAG 1156 

CAAACTGGGG CTGTGTTTTC GATGATGATG ATCATCATGA TCATGATNNN NNNNNNNNNN 1216 

NNNNNNNNNN NNNNNNNNNN NNNNNNGATT TTAACAGTTT TACTTCTGGC CTTTCCTAGC 127 6 
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TAGAGAAGGA GTTAATATTT CTAAGGTAAC TCCCATATCT CCTTTAATGA CATTGATTTC 1336 

TAATGATATA AATTTCAGCC TACATTGATG CCAAGCTTTT TTGCCACAAA GAAGATTCTT 1396 

ACCAAGAGTG GGCTTTGTGG AAACAGCTGG TACTGATGTT CACCTTTATA TATGTACTAG 14 56 

CATTTTCCAC GCTGATGTTT ATGTACTGTA AACAGTTCTG CACTCTTGTA CAAAAGAAAA 1516 

AACACCTGTC ACATCCAAAT ATAGTATCTG TCTTTTCGTC AAAATAGAGA GTGGGGAATG 1576 

AGTGTGCCGA TTCAATACCT CAATCCCTGA ACGACACTCT CCTAATCCTA AGCCTTACCT 1636 

GAGTGAGAAG CCCTTTACCT AACAAAAGTC CAATATAGCT GAAATGTCGC TCTAATACTC 1696 

TTTACACATA TGAGGTTATA TGTAGAAAAA AATTTTACTA CTAAATGATT TCAACTATTG 1756 

GCTTTCTATA TTTTGAAAGT AATGATATTG TCTCATTTTT TTACTGATGG TTTAATACAA 1816 

AATACACAGA GCTTGTTTCC CCTCATAAGT AGTGTTCGCT CTGATATGAA CTTCACAAAT 1876 

ACAGCTCATC AAAAGCAGAC TCTGAGAAGC CTCGTGCTGT AGCAGAAAGT TCTGCATCAT 1936 

GTGACTGTGG ACAGGCAGGA GGAAACAGAA CAGACAAGCA TTGTCTTTTG TCATTGCTCG 1996 

AAGTGCAAGC GTGCATACCT GTGGAGGGAA CTGGTGGCTG CTTGTAAATG TTCTGCAGCA 2056 

TCTCTTGACA CACTTGTCAT GACACAATCC AGTACCTTGG TTTTCAGGTT ATCTGACAAA 2116 

GGCAGCTTTG ATTGGGACAT GGAGGCATGG GCAGGCCGGA A 2157 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 295 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Ser Ala Tyr He Thr Pro Cys Thr Thr Ser Val Ser Asn Asp Val Cys 
15 10 15 

Asn Arg Arg Lys Cys His Lys Ala Leu Arg Gin Phe Phe Asp Lys Val 
20 25 30 

Pro Ala Lys His Ser Tyr Gly Met Leu Phe Cys Ser Cys Arg Asp He 
35 40 45 

Ala Cys Thr Glu Arg Arg Arg Gin Thr He Val Pro Val Cys Ser Tyr 

50 - 55 60 

Glu Glu Arg Glu Lys Pro Asn Cys Leu Asn Leu Gin Asp Ser Cys Lys 
65 70 75 80 
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Thr Asn Tyr He Cys Arg Ser Arg Leu Ala Asp Phe Phe Thr Asn Cys 
85 90 95 

Gin Pro Glu Ser Arg Ser Val Ser Ser Cys Leu Lys Glu Asn Tyr Ala 
100 105 HO 

Asp Cys Leu Leu Ala Tyr Ser Gly Leu He Gly Thr Val Met Thr Pro 
115 120 125 

Asn Tyr lie Asp Ser Ser Ser Leu Ser Val Ala Pro Trp Cys Asp Cys 
130 135 140 

Ser Asn Ser Gly Asn Asp Leu Glu Glu Cys Leu Lys Phe Leu Asn Phe 
145 150 155 160 

Phe Lys Asp Asn Thr Cys Leu Lys Asn Ala He Gin Ala Phe Gly Asn 
165 170 175 

Gly Ser Asp Val Thr Val Trp Gin Pro Ala Phe Pro Val Gin Thr Thr 
180 185 190 

Thr Ala Ala Thr Thr Thr Ala Leu Arg Val Lys Asn Lys Pro Leu Gly 
195 200 205 

Pro Ala Gly Ser Glu Asn Glu He Pro Thr His Val Leu Pro Pro Cys 
210 215 220 

Ala Asn Leu Gin Ala Gin Lys Leu Lys Ser Asn Val Ser Gly Asn Thr 
225 230 235 240 

His Leu Cys He Ser Asn Gly Asn Tyr Glu Lys Glu Gly Leu Gly Ala 
245 250 255 

Ser Ser His He Thr Thr Lys Ser Met Ala Ala Pro Pro Ser Cys Gly 
260 265 270 

Leu Ser Pro Leu Leu Val Leu Val Val Thr Ala Leu Ser Thr Leu Leu 
275 280 285 



Ser Leu Thr Glu Thr Ser * 
290 295 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 659 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 
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(B) LOCATION: 2..65B 



<ix) FEATURE: 

(A) NAME/KEY: misc_£eature 

(B) LOCATION: 1..659 

(D) OTHER INFORMATION: /note= w l to 659 is 1033 to 1691 of 
Figure 5 Hsgr-21ar" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

G AAT TTG CAG GAC TCC TGC AAG ACG AAT TAC ATC TGC AGA TCT CGC 46 
Asn Leu Gin Asp Ser Cys Lys Thr Asn Tyr He Cys Arg Ser Arg 
15 10 15 

CTT GCG GAT TTT TTT ACC AAC TGC CAG CCA GAG TCA AGG TCT GTC AGC 94 
Leu Ala Asp Phe Phe Thr Asn Cys Gin Pro Glu Ser Arg Ser Val Ser 
20 25 30 

AGC TGT CTA AAG GAA AAC TAC GCT GAC TGC CTC CTC GCC TAC TCG GGG 142 
Ser Cys Leu Lys Glu Asn Tyr Ala Asp Cys Leu Leu Ala Tyr Ser Gly 
35 40 45 

CTT ATT GGC ACA GTC ATG ACC CCC AAC TAC ATA GAC TCC AGT AGC CTC 190 
Leu He Gly Thr Val Met Thr Pro Asn Tyr He Asp Ser Ser Ser Leu 
50 55 60 

AGT GTG GCC CCA TGG TGT GAC TGC AGC AAC AGT GGG AAC GAC CTA GAA 2 38 

Ser Val Ala Pro Trp Cys Asp Cys Ser Asn Ser Gly Asn Asp Leu Glu 
65 70 75 

GAG TGC TTG AAA TTT TTG AAT TTC TTC AAG GAC AAT ACA TGT CTT AAA 286 
Glu Cys Leu Lys Phe Leu Asn Phe Phe Lys Asp Asn Thr Cys Leu Lys 
80 85 90 95 

AAT GCA ATT CAA GCC TTT GGC AAT GGC TCC GAT GTG ACC GTG TGG CAG 3 34 

Asn Ala He Gin Ala Phe Gly Asn Gly Ser Asp Val Thr Val Trp Gin 
100 105 HO 

CCA GCC TTC CCA GTA CAG ACC ACC ACT GCC ACT ACC ACC ACT GCC CTC 382 
Pro Ala Phe Pro Val Gin Thr Thr Thr Ala Thr Thr Thr Thr Ala Leu 
115 120 125 

CGG GTT AAG AAC AAG CCC CTG GGG CCA GCA GGG TCT GAG AAT GAA ATT 43 0 

Arg Val Lys Asn Lys Pro Leu Gly Pro Ala Gly Ser Glu Asn Glu He 
130 135 140 

CCC ACT CAT GTT TTG CCA CCG TGT GCA AAT TTA CAG GCA CAG AAG CTG 478 
Pro Thr His Val Leu Pro Pro Cys Ala Asn Leu Gin Ala Gin Lys Leu 
145 150 155 

AAA TCC AAT GTG TCG GGC AAT ACA CAC CTC TGT ATT TCC AAT GGT AAT 526 
Lys Ser Asn Val Ser Gly Asn Thr His Leu Cys He Ser Asn Gly Asn 
160 165 170 175 



WO 97/40152 




PCT/US97/06281 



TAT GAA AAA GAA GGT CTC GGT GCT TCC AGC CAC ATA ACC ACA AAA TCA 
Tyr Glu Lys Glu Gly Leu Gly Ala Ser Ser His lie Thr Thr Lys Ser 
180 185 190 



574 



ATG GCT GCT CCT CCA AGC TGT GGT CTG AGC CCA CTG CTG GTC CTG GTG 
Met Ala Ala Pro Pro Ser Cys Gly Leu Ser Pro Leu Leu Val Leu Val 



622 



GTA ACC GCT CTG TCC ACC CTA TTA TCT TTA ACA GAA A 
Val Thr Ala Leu Ser Thr Leu Leu Ser Leu Thr Glu 
210 215 



659 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Asn Leu Gin Asp Ser Cys Lys Thr Asn Tyr lie Cys Arg Ser Arg Leu 
15 10 15 

Ala Asp Phe Phe Thr Asn Cys Gin Pro Glu Ser Arg Ser Val Ser Ser 
20 25 30 

Cys Leu Lys Glu Asn Tyr Ala Asp Cys Leu Leu Ala Tyr Ser Gly Leu 
35 40 45 

lie Gly Thr Val Met Thr Pro Asn Tyr lie Asp Ser Ser Ser Leu Ser 
50 55 60 

Val Ala Pro Trp Cys Asp Cys Ser Asn Ser Gly Asn Asp Leu Glu Glu 
65 70 75 80 

Cys Leu Lys Phe Leu Asn Phe Phe Lys Asp Asn Thr Cys Leu Lys Asn 
85 90 95 

Ala He Gin Ala Phe Gly Asn Gly Ser Asp Val Thr Val Trp Gin Pro 
100 105 110 

Ala Phe Pro Val Gin Thr Thr Thr Ala Thr Thr Thr Thr Ala Leu Arg 
115 120 125 

Val Lys Asn Lys Pro Leu Gly Pro Ala Gly Ser Glu Asn Glu He Pro 
130 135 140 

Thr His Val Leu Pro Pro Cys Ala Asn Leu Gin Ala Gin Lys Leu Lys 
145 150 155 160 



Ser Asn Val Ser Gly Asn Thr His Leu Cys He Ser Asn Gly Asn Tyr 
165 170 175 
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Glu Lys Glu Gly Leu Gly Ala Ser Ser His lie Thr Thr Lys Ser Met 
180 185 190 

Ala Ala Pro Pro Ser Cys Gly Leu Ser Pro Leu Leu Val Leu Val Val 
195 200 205 



Thr Ala Leu Ser Thr Leu Leu Ser Leu Thr Glu 
210 215 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 3.. 629 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..630 

(D) OTHER INFORMATION: /note= "1 to 630 is 1062 to 1691 of 
Figure 5 Hsgr-21br" 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

AC ATC TGC AGA TCT CGC CTT GCG GAT TTT TTT ACC AAC TGC CAG CCA 47 
lie Cys Arg Ser Arg Leu Ala Asp Phe Phe Thr Asn Cys Gin Pro 
15 10 15 

GAG TCA AGG TCT GTC AGO AGC TGT CTA AAG GAA AAC TAC GCT GAC TGC 95 
Glu Ser Arg Ser Val Ser Ser Cys Leu Lys Glu Asn Tyr Ala Asp Cys 
20 25 30 

CTC CTC GCC TAC TCG GGG CTT ATT GGC ACA GTC ATG ACC CCC AAC TAC 14 3 

Leu Leu Ala Tyr Ser Gly Leu lie Gly Thr Val Met Thr Pro Asn Tyr 
35 40 45 

ATA GAC TCC AGT AGC CTC AGT GTG GCC CCA TGG TGT GAC TGC AGC AAC 191 
lie Asp Ser Ser Ser Leu Ser Val Ala Pro Trp Cys Asp Cys Ser Asn 
50 55 60 

AGT GGG AAC GAC CTA GAA GAG TGC TTG AAA TTT TTG AAT TTC TTC AAG 239 
Ser Gly Asn Asp Leu Glu Glu Cys Leu Lys Phe Leu Asn Phe Phe Lys 
65 70 75 



GAC AAT ACA TGT CTT AAA AAT GCA ATT CAA GCC TTT GGC AAT GGC TCC 
Asp Asn Thr Cys Leu Lys Asn Ala He Gin Ala Phe Gly Asn Gly Ser 
80 85 90 95 



2B7 
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GAT GTG ACC GTG TGG CAG CCA GCC TTC CCA GTA CAG ACC ACC ACT GCC 3 35 

Asp Val Thr Val Trp Gin Pro Ala Phe Pro Val Gin Thr Thr Thr Ala 
100 105 no 



ACT ACC ACC ACT GCC CTC CGG GTT AAG AAC AAG CCC CTG GGG CCA GCA 3 83 

TVi t~ TVi r* TVi r- TV» ■v Ma T.*»n & \7-*"\ Twr. t f-w * « ^ _ _ _ . . . - - 

^ — — - — ^ no*! i-tjr i nu ijtsu oxy no Axa 

115 120 125 



GGG TCT GAG AAT GAA ATT CCC ACT CAT GTT TTG CCA CCG TGT GCA AAT 431 
Gly Ser Glu Asn Glu lie Pro Thr His Val Leu Pro Pro Cys Ala Asn 
130 135 140 

TTA CAG GCA CAG AAG CTG AAA TCC AAT GTG TCG GGC AAT ACA CAC CTC 479 
Leu Gin Ala Gin Lys Leu Lys Ser Asn Val Ser Gly Asn Thr His Leu 
145 " 150 155 

TGT ATT TCC AAT GGT AAT TAT GAA AAA GAA GGT CTC GGT GCT TCC AGC 527 
Cys lie Ser Asn Gly Asn Tyr Glu Lys Glu Gly Leu Gly Ala Ser Ser 
160 165 170 175 

CAC ATA ACC ACA AAA TCA ATG GCT GCT CCT CCA AGC TGT GGT CTG AGC 57 5 

His lie Thr Thr Lys Ser Met Ala Ala Pro Pro Ser Cys Gly Leu Ser 
180 185 190 

CCA CTG CTG GTC CTG GTG GTA ACC GCT CTG TCC ACC CTA TTA TCT TTA 62 3 

Pro Leu Leu Val Leu Val Val Thr Ala Leu Ser Thr Leu Leu Ser Leu 
195 200 205 

ACA GAA A 63 0 

Thr Glu 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

He Cys Arg Ser Arg Leu Ala Asp Phe Phe Thr Asn Cys Gin Pro Glu 
15 10 15 

Ser Arg Ser Val Ser Ser Cys Leu Lys Glu Asn Tyr Ala Asp Cys Leu 
20 25 30 

Leu Ala Tyr Ser Gly Leu He Gly Thr Val Met Thr Pro Asn Tyr He 
35 40 45 



Asp Ser Ser Ser Leu Ser Val Ala Pro Trp Cys Asp Cys Ser Asn Ser 
50 55 60 
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Gly Asn Asp Leu Glu Glu Cys Leu 
65 70 

Asn Thr Cys Leu Lys Asn Ala lie 
85 

Val Thr Val Trp Gin Pro Ala Phe 
100 

Thr Thr Thr Ala Leu Arg Val Lys 

115 120 



Lys Phe Leu Asn Phe Phe Lys Asp 
75 80 

Gin Ala Phe Gly Asn Gly Ser Asp 
90 95 

Pro Val Gin Thr Thr Thr Ala Thr 
105 HO 

Asn Lys Pro Leu Gly Pro Ala Gly 
125 



Ser Glu Asn Glu He Pro Thr His 
130 135 

Gin Ala Gin Lys Leu Lys Ser Asn 
145 150 

He Ser Asn Gly Asn Tyr Glu Lys 
165 



Val Leu Pro Pro Cys Ala Asn Leu 
140 

Val Ser Gly Asn Thr His Leu Cys 
155 160 

Glu Gly Leu Gly Ala Ser Ser His 
170 175 



lie Thr Thr Lys Ser Met Ala Ala 
180 

Leu Leu Val Leu Val Val Thr Ala 

195 200 



Pro Pro Ser Cys Gly Leu Ser Pro 
185 190 

Leu Ser Thr Leu Leu Ser Leu Thr 
205 



Glu 



(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1075 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 2.. 445 



{ix) FEATURE: 

(A) NAME/ KEY: misc__f eature 

(B) LOCATION: 1..1075 

(D) OTHER INFORMATION: /note= n l to 1075 is 1255 to 2330 
of Figure 5 Hsgr-2" 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



T GGG AAC GAC CTA GAA GAG TGC TTG AAA TTT TTG AAT TTC TTC AAG 46 
Gly Asn Asp Leu Glu Glu Cys Leu Lys Phe Leu Asn Phe Phe Lys 
15 10 15 

GAC AAT ACA TGT CTT AAA AAT GCA ATT CAA GCC TTT GGC AAT GGC TCC 94 
Asp Asn Thr Cys Leu Lys Asn Ala He Gin Ala Phe Gly Asn Gly Ser 
20 25 30 

GAT GTG ACC GTG TGG CAG CCA GCC TTC CCA GTA CAG ACC ACC ACT GCC 14 2 

Asp Val Thr Val Trp Gin Pro Ala Phe Pro Val Gin Thr Thr Thr Ala 
35 40 45 

ACT ACC ACC ACT GCC CTC CGG GTT AAG AAC AAG CCC CTG GGG CCA GCA 190 
Thr Thr Thr Thr Ala Leu Arg Val Lys Asn Lys Pro Leu Gly Pro Ala 
50 55 60 

GGG TCT GAG AAT GAA ATT CCC ACT CAT GTT TTG CCA CCG TGT GCA AAT 238 
Gly Ser Glu Asn Glu He Pro Thr His Val Leu Pro Pro Cys Ala Asn 
65 70 75 

TTA CAG GCA CAG AAG CTG AAA TCC AAT GTG TCG GGC AAT ACA CAC CTC 2 86 

Leu Gin Ala Gin Lys Leu Lys Ser Asn Val Ser Gly Asn Thr His Leu 
80 85 90 95 

TGT ATT TCC AAT GGT AAT TAT GAA AAA GAA GGT CTC GGT GCT TCC AGC 3 34 

Cys He Ser Asn Gly Asn Tyr Glu Lys Glu Gly Leu Gly Ala Ser Ser 
100 105 110 

CAC ATA ACC ACA AAA TCA ATG GCT GCT CCT CCA AGC TGT GGT CTG AGC 382 
His He Thr Thr Lys Ser Met Ala Ala Pro Pro Ser Cys Gly Leu Ser 
115 120 125 

CCA CTG CTG GTC CTG GTG GTA ACC GCT CTG TCC ACC CTA TTA TCT TTA 430 
Pro Leu Leu Val Leu Val Val Thr Ala Leu Ser Thr Leu Leu Ser Leu 
130 135 140 

ACA GAA ACA TCA TAG CTGCATTAAA AAAATACAAT ATGGACATGT AAAAAGACAA 485 
Thr Glu Thr Ser * 
145 

AAACCAAGTT ATCTGTTTCC TGTTCTCTTG TATAGCTGAA ATTCCAGTTT AGGAGCTCAG 545 

TTGAGAAACA GTTCCATTCA ACTGGAACAT TTTTTTTTTT CCTTTTAAGA AAGCTTCTTG 605 

TGATCCTTCG GGGCTTCTGT GAAAAACCTG ATGCAGTGCT CCATCCAAAC TCAGAAGGCT 665 

TTGGGATATG CTGTATTTTA AAGGGACAGT TTGTAACTTG GGCTGTAAAG CAAACTGGGG 725 

CTGTGTTTTC GATGATGATG ATCATCATGA TCATGATNNN NNNNNNNNNN NNNNNNNNNN 785 

NNNNNNNNNN NNNNNNGATT TTAACAGTTT TACTTCTGGC CTTTCCTAGC TAGAGAAGGA 845 

GTTAATATTT CTAAGGTAAC TCCCATATCT CCTTTAATGA CATTGATTTC TAATGATATA 905 
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AATTTCAGCC TACATTGATG CCAAGCTTTT TTGCCACAAA GAAGATTCTT ACCAAGAGTG 9 65 

GGCTTTGTGG AAACAGCTGG TACTGATGTT CACCTTTATA TATGTACTAG CATTTTCCAC 1025 
GCTGATGTTT ATGTACTGTA AACAGTTCTG CACTCTTGTA CAAAAGAAAA 1075 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 148 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Gly Asn Asp Leu Glu Glu Cys Leu Lys Phe Leu Asn Phe Phe Lys Asp 
15 10 15 

Asn Thr Cys Leu Lys Asn Ala He Gin Ala Phe Gly Asn Gly Ser Asp 
20 25 30 

Val Thr Val Trp Gin Pro Ala Phe Pro Val Gin Thr Thr Thr Ala Thr 
35 40 45 

Thr Thr Thr Ala Leu Arg Val Lys Asn Lys Pro Leu Gly Pro Ala Gly 
50 55 60 

Ser Glu Asn Glu He Pro Thr His Val Leu Pro Pro Cys Ala Asn Leu 
65 70 75 80 

Gin Ala Gin Lys Leu Lys Ser Asn Val Ser Gly Asn Thr His Leu Cys 
85 90 95 

He Ser Asn Gly Asn Tyr Glu Lys Glu Gly Leu Gly Ala Ser Ser His 
100 105 110 

He Thr Thr Lys Ser Met Ala Ala Pro Pro Ser Cys Gly Leu Ser Pro 
115 120 125 

Leu Leu Val Leu Val Val Thr Ala Leu Ser Thr Leu Leu Ser Leu Thr 
130 135 140 

Glu Thr Ser * 
145 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1059 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3. .428 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 1..1059 

(D) OTHER INFORMATION: /note* "1 to 1059 is 1272 to 2330 
of Figure 5 Hsgr-9" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

AG TGC TTG AAA TTT TTG AAT TTC TTC AAG GAC AAT ACA TGT CTT AAA 47 
Cys Leu Lys Phe Leu Asn Phe Phe Lys Asp Asn Thr Cys Leu Lys 
15 10 15 

AAT GCA ATT CAA GCC TTT GGC AAT GGC TCC GAT GTG ACC GTG TGG CAG 95 
Asn Ala lie Gin Ala Phe Gly Asn Gly Ser Asp Val Thr Val Trp Gin 
20 25 30 

CCA GCC TTC CCA GTA CAG ACC ACC ACT GCC ACT ACC ACC ACT GCC CTC 143 
Pro Ala Phe Pro Val Gin Thr Thr Thr Ala Thr Thr Thr Thr Ala Leu 
35 40 45 

CGG GTT AAG AAC AAG CCC CTG GGG CCA GCA GGG TCT GAG AAT GAA ATT 191 
Arg Val Lys Asn Lys Pro Leu Gly Pro Ala Gly Ser Glu Asn Glu He 
50 55 60 

CCC ACT CAT GTT TTG CCA CCG TGT GCA AAT TTA CAG GCA CAG AAG CTG 239 
Pro Thr His Val Leu Pro Pro Cys Ala Asn Leu Gin Ala Gin Lys Leu 
65 70 75 

AAA TCC AAT GTG TCG GGC AAT ACA CAC CTC TGT ATT TCC AAT GGT AAT 287 
Lys Ser Asn Val Ser Gly Asn Thr His Leu Cys He Ser Asn Gly Asn 
80 85 90 95 

TAT GAA AAA GAA GGT CTC GGT GCT TCC AGC CAC ATA ACC ACA AAA TCA 335 
Tyr Glu Lys Glu Gly Leu Gly Ala Ser Ser His He Thr Thr Lys Ser 
100 105 110 

ATG GCT GCT CCT CCA AGC TGT GGT CTG AGC CCA CTG CTG GTC CTG GTG 383 
Met Ala Ala Pro Pro Ser Cys Gly Leu Ser Pro Leu Leu Val Leu Val 
115 120 125 

GTA ACC GCT CTG TCC ACC CTA TTA TCT TTA ACA GAA ACA TCA TAG 428 
Val Thr Ala Leu Ser Thr Leu Leu Ser Leu Thr Glu Thr Ser * 
130 135 140 

CTGCATTAAA AAAATACAAT ATGGACATGT AAAAAGACAA AAACCAAGTT ATCTGTTTCC 488 

TGTTCTCTTG TATAGCTGAA ATTCCAGTTT AGGAGCTCAG TTGAGAAACA GTTCCATTCA 54 8 



ACTGGAACAT TTTTTTTTTT TCCTTTTAAG AAAGCTTCTT GTGATCCTTT GGGGCTTCTG 608 
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TGAAAAACCT GATGCAGTGC TCCATCCAAA CTCAGAAGGC TTTGGGATAT GCTGTATTTT 
AAAGGGACAG TTTGTAACTT GGGCTGTAAA GCAAACTGGG GCTGTGTTTT CGATGATGAT 
GATGATCATG ATGATGATCA TCATGATCAT GATGATGATC ATCATGATCA TGATGATGAT 
TTTAACAGTT TTACTTCTGG CCTTTCCTAG CTAGAGAAGG AGTTAATATT TCTAAGGTAA 
CTCCCATATC TCCTTTAATG ACATTGATTT CTAATGATAT AAATTTCAGC CTACATTGAT 
GCCAAGCTTT TTTGCCACAA AGAAGATTCT TACCAAGAGT GGGCTTTGTG GAAACAGCTG 
GTACTGATGT TCACCTTTAT ATATGTACTA GCATTTTCCA CGCTGATGTT TATGTACTGT 
AAACAGTTCT GCACTCTTGT ACAAAAGAAA A 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Cys Leu Lys Phe Leu Asn Phe Phe Lys Asp Asn Thr Cys Leu Lys Asn 
15 10 15 

Ala He Gin Ala Phe Gly Asn Gly Ser Asp Val Thr Val Trp Gin Pro 
20 25 30 

Ala Phe Pro Val Gin Thr Thr Thr Ala Thr Thr Thr Thr Ala Leu Arg 
35 40 45 

Val Lys Asn Lys Pro Leu Gly Pro Ala Gly Ser Glu Asn Glu He Pro 
50 55 60 

Thr His Val Leu Pro Pro Cys Ala Asn Leu Gin Ala Gin Lys Leu Lys 
65 70 75 80 

Ser Asn Val Ser Gly Asn Thr His Leu Cys He Ser Asn Gly Asn Tyr 
65 90 95 

Glu Lys Glu Gly Leu Gly Ala Ser Ser His He Thr Thr Lys Ser Met 
100 105 HO 

Ala Ala Pro Pro Ser Cys Gly Leu Ser Pro Leu Leu Val Leu Val Val 



668 
728 
788 
848 
908 
968 
1028 
1059 



115 



120 



125 



Thr 



Ala 
130 



Leu 



Ser 



Thr 



Leu Leu 
135 



Ser 



Leu Thr Glu 



Thr Ser 
140 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
Gin Ser Cys Ser Thr Lys Tyr Arg Thr Leu 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Cys Lys Arg Gly Met Lys Lys Glu Lys Asn 
15 10 



(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

Leu Leu Glu Asp Ser Pro Tyr Glu Pro Val 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 




1 



5 



10 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



Cys Ser Tyr Glu Glu Arg Glu Arg Pro Asn 
15 10 



<2> INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Pro Ala Pro Pro Val Gin Thr Thr Thr Ala Thr Thr Thr Thr 
15 10 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CTGTTTGAAT TTGCAGGACT C 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



CTCCTCTCTA AGCTTCTAAC CACAGCTTGG AGGAGC 
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(2) INFORMATION FOR SEQ ID NO: 32: 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 
/ n ) TOPOLOGY : linsar 



(ii) 



MOLECULE TYPE: CDNA 



(Xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



CTCCTCTCTA AGCTTCTATG GGCTCAGACC ACAGCTT 



37 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CTCCTCTCTA AGCTTCTACT TGTCATCGTC GTCCTTGTAG TCACCACAGC TTGGAGGAGC 60 



(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
CTCCTCTCTA AGCTTCTACT TGTCATCGTC GTCCTTGTAG TCTGGCTCAG ACCACAGCTT 60 



\ 
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CLAIMS 

What is claimed is: 

5 1 . An isolated and purified protein comprising an amino acid sequence as depicted 
in Figure 2 or 4 (SEQ ID NO: 2 or 4) and analogs thereof wherein the protein is 
capable of complexing with glial cell line-derived neurotrophic factor (GDNF) and 
thereby mediating cell response to GDNF. 

10 2 . A protein of Claim 1 comprising the amino acid sequence as depicted in 
Figure 2 (SEQ ID NO: 2). 

3 . A protein of Claim I comprising the amino acid sequence as depicted in 
Figure 4 (SEQ ID NO:4). 

15 

4 . A protein of Claim 1 comprising the amino acid sequence Ser^ 8 through 
Pro 446 as depicted in Figure 2 (SEQ ID NO:2). 

5 . A protein of Claim 1 comprising the amino acid sequence Asp 25 through 
20 Leu 447 as depicted in Figure 2 (SEQ ID NO:2). 

6 . A protein of Claim 1 comprising the amino acid sequence Cys 29 through 
Cys 442 as depicted in Figure 2 (SEQ ID NO:2). 

25 7. A protein of Claim 1 comprising the amino acid sequence Ala* 9 through 
Val 450 as depicted in Figure 4 (SEQ ID NO:4). 

8 . A protein of Claim 1 comprising the amino acid sequence Cys 29 through 
Cys 443 as depicted in Figure 4 (SEQ ID NO:4). 

30 

9 . A protein of Claim 1 which is glycosylated, 

10. A protein of Claim 1 which is non-glycosylated. 

35 11. A protein of Claims 1 to 10 which is produced by recombinant technology or 
chemical synthesis. 
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12. A pharmaceutical composition comprising a protein as claimed in any one of 
claims 1 to 10 in combination with a pharmaceutical^ acceptable carrier, 

13. An isolated nucleic acid sequence encoding a neurotrophic factor receptor 
5 protein comprising an amino acid sequence as claimed in any one of claims i to 8. 

14. An isolated nucleic acid sequence encoding a neurotrophic factor receptor 
protein comprising an amino acid sequence as depicted in Figure 2 or 4 (SEQ ID NO: 
2 or 4) and analogs thereof wherein the protein is capable of complexing with glial cell 

10 line-derived neurotrophic factor (GDNF) and thereby mediating cell response to 
GDNF. 



15. A nucleic acid sequence of Claim 14 encoding a neurotrophic factor receptor 
protein comprising the amino acid sequence as depicted in Figure 2 (SEQ ID NO: 2). 

1 6. A nucleic acid sequence of Claim 14 encoding a neurotrophic factor receptor 
protein comprising the amino acid sequence as depicted in Figure 4 (SEQ ID NO:4). 



17. An isolated nucleic acid sequence comprising: 

20 (a) a sequence set forth in Figure 1 (SEQ ID NO: 1) comprising nucleotides 
encoding Met 1 through Ser 465 or Figure 3 (SEQ ID NO: 3) comprising 
nucleotides encoding Met 1 through Ser^68^ wherein said sequence encodes a 
neurotrophic factor receptor protein (GDNFR) capable of complexing with 
glial cell line-derived neurotrophic factor (GDNF) and thereby mediating cell 

25 response to GDNF; 

(b) a nucleic acid sequence which (1) hybridizes to a complementary sequence of 
(a) and (2) encodes an amino acid sequence with GDNFR activity; and 

(c) a nucleic acid sequence which but for the degeneracy of the genetic code 
would hybridize to a complementary sequence of (a) and (2) encodes an 

30 amino acid sequence with GDNFR activity. 



18. A vector comprising a nucleic acid sequence according to any of claims 14 
to 17 operatively linked to one or more operational elements capable of effecting the 
amplification or expression of said nucleic acid sequence. 

19. A vector comprising a nucleic acid sequence encoding a neurotrophic factor 
receptor protein comprising the amino acid sequence as depicted in Figure 2 or 4 
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10 



15 



20 



(SEQ ED NO: 2 or 4) operatively linked to one or more operational elements capable of 
effecting the amplification or expression of said nucleic acid sequence. 

20. A host cell transformed or transfected with the vector of claim 18. 

21. A host cell transformed or transfected with the vector of claim 1 9. 

22 . A host cell of claim 20 selected from the group consisting of mammalian cells 
and bacterial cells. 

23 . A host cell of claim 22 which is a COS-7 cell or E. coli. 

24 . A host cell of Claim 20 wherein said cell is suitable for human implantation and 
wherein said cell expresses and secretes said neurotrophic factor receptor. 

25 . A host cell of Claim 21 wherein said cell is suitable for human implantation and 
wherein said cell expresses and secretes said neurotrophic factor receptor. 

26 . A host cell of Claim 20 wherein said cell is transformed or transfected ex vivo. 

27 . A host cell of Claim 20 wherein said cell is enclosed in a semipermeable 
membrane suitable for human implantation. 

28. A method for the production of a neurotrophic factor receptor protein 
comprising the steps of: 

(a) culturing a host cell, containing a nucleic acid sequence encoding a 
neurotrophic factor receptor protein comprising an amino acid sequence as 
depicted in Figure 2 or 4 (SEQ ID NO: 2 or 4) and analogs thereof wherein the 
protein is capable of complexing with glial cell line-derived neurotrophic factor 
(GDNF) and thereby mediating cell response to GDNF, under conditions 
suitable for the expression of said neurotrophic factor receptor protein by said 
host cell; and 

(b) optionally, isolating said neurotrophic factor receptor protein expressed by said 
host cell. 



29. A method of claim 28, wherein said nucleic acid sequence encodes a 
neurotrophic factor receptor protein comprising the amino acid sequence as depicted in 



WO 97/40152 




PCTYUS97/06281 



Figure 2 (SEQ ID NO:2). 

30. A method of claim 28, wherein said nucleic acid sequence encodes a 
neurotrophic factor receptor protein comprising the amino acid sequence as depicted in 

5 Figure 4 (SEQ ID NO:4). 

31. A method for the production of a neurotrophic factor receptor protein 
comprising the steps of: 

(a) culturing a host cell transformed or transfected with a nucleic acid sequence 
10 according to claim 1 7 under conditions suitable for the expression of said 

neurotrophic factor receptor protein by said host cell; and 

(b) optionally, isolating said neurotrophic factor receptor protein expressed by 
said host cell. 

15 32, A method of claim 28 or 3 1, further comprising the step of refolding the 
isolated neurotrophic factor receptor. 

33. A method of claim 28 or 3 1 , wherein said host cell is a prokaryotic cell. 

20 34. A method of claim 28 or 3 1 , wherein said host cell is a eukaryotic cell. 

35. A substantially purified neurotrophic factor receptor protein prepared according 
to the method of any of claims 28 to 3 1 > 

25 36. The use of the neurotrophic factor receptor protein of claim 1 for the 
manufacture of a pharmaceutical composition. 

37. A method of treating improperly functioning dopaminergic nerve cells by 
administering a neurotrophic factor receptor protein of claim 1 . 

30 

38. A method of treating Parkinson' s disease by administering a neurotrophic 
factor receptor protein of claim 1. 

39. A method of treating Alzheimer's disease by administering a neurotrophic 
35 factor receptor protein of claim 1 . 



40. 



A method of treating amyotrophic lateral sclerosis by administering a 
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neurotrophic protein of claim 1 . 

41. An antibody that binds to a neurotrophic factor receptor protein comprising an 
amino acid sequence of SEQ ID NO:2 or SEQ ID NO:4. 

5 

42 . The antibody of claim 4 1 wherein said antibody is a monoclonal antibody. 

43 . The antibody of claim 4 1 wherein said antibody is a polyclonal antibody. 

10 44. An antibody produced by immunizing an animal with a neurotrophic factor 
receptor protein comprising an amino acid sequence of SEQ ID NO:2 or SEQ ID 
NO:4. 

45 . A hybridoma that produces a monoclonal antibody that binds to a neurotrophic 
15 factor receptor protein comprising an amino acid sequence of SEQ ID NO:2 or SEQ ID 

NO:4. 

46. A device for treating nerve damage, comprising: 

(a) a semipermeable membrane suitable for implantation; and 
20 (b) cells encapsulated within said membrane, wherein said cells secrete a 
neurotrophic factor receptor protein according to claim 1 ; 
said membrane being permeable to the neurotrophic factor receptor protein and 
impermeable to materials detrimental to said cells. 

25 47 . The device of claim 46, wherein said cells are naturally occurring cells that 
secrete said neurotrophic factor receptor protein. 

48. The device of claim 46, wherein said cells have been modified to secrete said 
neurotrophic factor receptor protein by means of a nucleic acid sequence comprising: 
30 (a) a sequence set forth in Figure 1 (SEQ ID NO. : 1 ) comprising nucleotides 
encoding Met 1 through Ser 465 or Figure 3 (SEQ ID NO: 3) comprising 
nucleotides encoding Met 1 through Ser 4 *^ encoding a neurotrophic factor 
receptor protein (GDNFR) capable of complexing with glial cell line-derived 
neurotrophic factor (GDNF) and mediating cell response to GDNF; 
35 (b) a nucleic acid sequence which ( 1 ) hybridizes to a complementary sequence of 
(a) and (2) encodes an amino acid sequence with GDNFR activity; and 
(c) a nucleic acid sequence which but for the degeneracy of the genetic code would 
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hybridize to a complementary sequence of (a) and (2) encodes an amino acid 
sequence with GDNFR activity. 

49. An assay device for analyzing a test sample for the presence of glial cell line- 
5 derived neurotrophic factor, comprising; a solid phase containing or coated with a 

GDNFR protein, wherein said GDNFR protein reacts with GDNF present in the test 
sample and produces a detectable reaction product indicative of the presence of GDNF. 

50. A method for analyzing a test sample for the presence of glial cell line-derived 
10 neurotrophic factor, comprising: contacting the sample to an assay reagent comprising 

GDNFR protein, wherein said GDNFR protein reacts with GDNF present in the test 
sample and produces a detectable reaction product indicative of the presence of GDNF. 
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FIG.1 



10 30 50 

AATCTGGCCTCGGAACACGCCATTCTCCGCGCCGCTTCCAATAACCACTAACATCCCTA 

70 90 110 

ACGAGCATCCGAGCCGAGGGCTCTGCTCGGAAATCGTCCTGGCCCAACTCGGCCCTTCGA 

130 150 170 

GCTCTCGAAGATTACCGCATCTATTTTTTTTTTCTTTTTTTTCTTTTCCTAGCGCAGATA 

190 210 230 

AAGTGAGCCCGGAAAGGGAAGGAGGGGGCGGGGACACCATTGCCCTGAAAGAATAAATAA 

250 270 290 

GTAAATAAACAAACTGGCTCCTCGCCGCAGCTGGACGCGGTCGGTTGAGTCCAGGTTGGG 

310 330 350 

TCGGACCTGAACCCCTAAAAGCGGAACCGCCTCCCGCCCTCGCCATCCCGGAGCTGAGTC 

370 390 410 

GCCGGCGGCGGTGGCTGCTGCCAGACCCGGAGTTTCCTCTTTCACTGGATGGAGCTGAAC 

430 450 470 

TTTGGGCGGCCAGAGCAGCACAGCTGTCCGGGGATCGCTGCACGCTGAGCTCCCTCGGCA 

490 510 530 

AGACCCAGCGGCGGCTCGGGATTTTTTTGGGGGGGCGGGGACCAGCCCCGCGCCGGCACC 

550 570 590 

ATGTTCCTGGCGACCCTGTACTTCGCGCTGCCGCTCTTGGACTTGCTCCTGTCGGCCGAA 
MFLATLYFALPLLDLLLSAE 

SUBSTITUTE SHEET (RULE 26) 



WO 97/401^^ PCT/US97/06281 



2/5 1 



< 

CD 



o 
in 



o 
m 
vo 



< 

u 
< 
u 

o 

u 
u 
u 

H 
O 
< 

u 

< 

a 

H 

a 
< 

u 
u 
o 



o 

a 
u 

o 

E-» 
< 

o 

H 
U 
U 
O 
U 
CJ 
< 

< 

o 
e> 
u 
u 
a 
u 
a 
< 
a 

H 



CO 



u 



a 



u 



to 



o 
cn 
vo 



o 



u 

u 

u 
u 
< 

< 

o 
o 

u 
o 
o 
o 
a 
a 
o 

H 
O 

u 
o 

H 
C 
< 
U 

a 
u 

U 

o 
u 
< 
u 
o 
u 
u 
< 

O 

u 
u 
< 
u 
a 
< 
u 
o 

H 
U 

o 
< 



CxJ 



O 



as 



CO 



to 



o 



o 
in 



o 



2 

o 

CJ 
U 

a 
a 
a 
< 

a 

H 

u 
u 
o 

CJ 

a 
< 

a 
a 
u 
u 
a 

o 
< 
o 

< 

o 
o 

CJ 

u 
o 
o 
< 
a 
o 

H 

CJ 
CJ 

e> 
e> 
u 
u 

H 

< 

u 
o 
o 

u 
u 
a 
< 



to 



CO 



a: 



CO 



o 
m 

CO 



CO 



o 



CO 



U 
CJ 
O 

a 

2 

< 

o 

2 
2 

o 

< 

O 
O 

O 

u 
a 

U 

o 

u 
o 
u 
u 
a 

u 

2 

cj 
< 

u 

H 
U 
O 
CJ 

a 

2 

< 



CO 



u 



u 



CO 



a 



o 

CO 



o 

CO 



c 

LI 
CD 



< 

u 
u 
u 
u 

H 
< 
O 

< 

o 
a 

H 

u 
a 

U 
< 



a 
o 
a 
< 
u 

CJ 

cj 
o 
< 

< 

CJ 

u 
< 

H 

a 

H 

< 

u 

< 

o 
u 

H 

u 
< 

H 
< 

o 
u 



CO 



to 



CO 



CO 



SUBSTITUTE SHEET (RULE 26) 



WO 97/40152 ^HCT7US97/06281 



3/5 1 



00 

6 
LL 



O 

in 



o 

ON 



< 

O 

< 

< 
CJ 
H 
£h 
< 
U 
U 
U 
H 

o 

a 
o 
a 
u 

H 

< 
< 
< 

u 

O 
< 

u 
a 
< 
u 

H 
H 
O 
< 

CJ 

u 

O 
H 
< 



to 



CO 



CO 





CJ 






U 












CJ 






U 












a 






< 












o 

^ 






CC 






TC 












< 


E- 4 






U- 




a 






u 






a 






u 






o 






< 






o 






H 


Cj 




u 


o 


o 


< 






O 






o 




rH 


u 






CJ 




ro 


o 




o 


o 


< 


o 


U 


a< 




a 


cc 


i— ( 


AT 




rH 


CC 






TC 






o 


Q 




< 






u 






a 






cj 






cj 


















CJ 






u 






< 


M 




O 


< 




u 












o 






u 






< 












H 


u 






















CG 






AC 








z 




a 


< 




CJ 












CG 






GC 








2 




H 


CO 




H 


u 










O 






a 






a 






O 






^ 




o 


o 


a 


o 


< 


PC 


o 












in 


u 




rH 


a 




C\ 






o 


< 




rH 


o 








u: 


rH 






rH 


u 


a: 




u 






O 






CJ 






cj 












o 






CJ 


cu 






U 




CJ 


a: 










o 






CJ 
























< 


M 












z 




CJ 






CJ 






CJ 












a 






o 






u 


X 






u 




H 


u 




o 






£h 






U 






3 






Eh 






H 








u 




< 


M 




C 


> 




a 






























5 






o 


> 




CG 


Q 




CG 


o 


o 






O 


< 




o 






r- 


u 


o 


ro 




Q 




5 




a\ 






O 


u 




o 


u 






< 




rH 


H 




rH 


CJ 






cj 


a 




(J 


iJ 




H 


CO 




H 






CJ 






O 


















H 






H 












O 


> 




H 






CJ 






CJ 












a 






a 






a 


> 






U 




< 


CO 



SUBSTITUTE SHEET (RULE 26) 



WO 97/4015: 




PCT/US97/06281 



4/5 1 



U 



CD 

LL 





u 






CJ 






(J 


< 




u 












< 


H 




U 






< 






o 


D 


o 








o 




?— ( 


o 


OS 


t— ( 


u 






u 






Eh 


U 




u 






u 








CO 




r i 






ri 






ch 


f \ 




r ) 














f, . 




p i 






H 






U 


►J 




r . 






5 




r«v 






rH 






rH 


O 


O 




u 






< 








>< 




U 






a 






< 


CO 




u 






< 






u 


EC 




o 






$ 






u 






u 






o 


< 




u 




O 


u 




in 


u 






u 






H 






o 


> 




a 






s 












< 








Q 













us 




o 






< 






o 


w 




O 






O 






< 


a: 


O 


o 




in 


< 








w 


rH 








r< 

o 


w 










< 






H 






U 






CJ 






fcH 


to 




r % 






rn 
w 






cH 


u 










r. 
fcH 






r r» 


> 










CJ 




O 


CJ 


Q-i 




O 




CM 


fH 




rH 


a 


> 




CJ 












< 


M 




U 






u 






< 












< 






cj 


a 




< 






o 






u 






a 






o 






< 


(X 




o 




o 


o 




rH 


u 


(X 


(N 


a 




rH 


< 








w 




< 












< 




















u 

















u 






u 






u 






u 


a; 




Eh 






a 






Eh 


to 


o 


< 




rH 






CO 


< 


a: 


rH 


u 






a 








u 




u 












< 


VH 










< 






i: 












3 






a 






o 






< 










O 






C\ 


CJ 




CM 


o 




rH 


fr< 


o 




u 






u 






Eh 


to 




cj 






< 






o 


Q 




a 






< 






u 


O 




a 






H 






TT 


iJ 




AA 






O 




O 






r- 


H 










rH 


O 








CJ 




CJ 






< 






< 






u 






u 






u 





2 





H 






U 




o 


H 






O 




m 


H 


CJ 


rH 


U 






O 






< 


CO 




u 






a 






< 


CO 




u 






H 






O 


> 




H 






U 








to 




o 






a 






AA 


q: 


O 


TC 


CO 


in 


O 




m 


< 




rH 




CO 




< 






u 






u 


Oi 




o 






< 






u 


a 




u 






o 






H 


u 




u 












u 






u 






< 






H 




o 


Eh 




m 


H 




m 


Eh 




rH 


Eh 






Eh 






H 






< 






o 


D 




O 






U 






a 


< 





r i 






2 

r \ 






\-J 

r \ 
\J 






r \ 


Um 




r \ 
\J 






CJ 








tH 








ro 








*** 


^* 


r ^ 




















< 












< 






CJ 






o 






o 


















< 


H 




H 












U 


*4 




o 






o 




O 


o 




rH 


a 




u 




rH 




CO 




CJ 






< 






Eh 






U 






U 






o 


< 




u 






H 






u 






u 






Eh 






u 


•J 




U 






O 






H 


a 




U 




O 


< 




CT\ 


u 


D 


ro 


Eh 




rH 


CJ 






o 


< 




U 






< 






Eh 


>* 




U 






% 


2 



SUBSTITUTE SHEET (RULE 26) 



WO 97/40152 



CMJS97/06281 



5/5 1 



Q 

6 





u 






< 






< 






o 












o 


o 




H 












< 


cn 


o 


u 






< 






< 


2 




u 






a 






< 


CO 




u 












E- 


u 




U 






< 






O 


a 
















r. 


u 




O 














£ 










U 




O 


u 


a. 




u 












o 


< 




o 






Ir* 






O 


> 










O 






< 


co 




u 






H 






U 






u 






a 






< 


CO 










o 






< 


co 








o 






in 




co 




u 












a 


Q 




< 






E- 






< 


M 




u 






< 











o 
in 
in 



in 



E- 
U 

o 

E- 
< 

U 
< 

a 
u 

u 

U 
H 

I 

H 

H 



U 



U 



W 



< 

a 
< 
u 
u 
u 

H 
H 
U 
U 

a 
< 
u 
u 
o 
< 
u 

u 

H 
O 
H 
O 
U 
U 
< 

o 

H 

e> 

< 

a 
u 
u 

u 
u 
a 

u 
o 

H 
E- 
H 

u 

o u 

u 

H 
H 

(J 
O 



o 

Ok 

in 



a 



CO 





< 






CJ 






(J 
























ri 






r . 

ir* 






CJ 












r \ 

\J 




V© 


r \ 














*>✓ 




r * 












o 






< 






H 






O 


> 










O 






U 






U 










O 


u 




in 


u 




ID 


u 




r-i 


e> 


< 










u 






< 


H 




u 






u 






< 






u 






u 






< 












a 






< 






u 






u 






o 


< 








O 


O 




m 






VD 


U 




r-i 


U 






< 


Eh 




u 






u 






< 


H 










< 






u 


o 



SUBSTITUTE SHEET (RULE 26) 



WO 97/401521 



PCT/US97/06281 



6/5 1 



LU 



O 



CJ 
O 
H 

a 

H 
U 

u 
< 
u 
u 
a 

H 

O 

< 

U 

U 
< 

u 
u 
u 

H 



u 



to 



to 



CO 



CO 



o 
in 

CO 



< 
u 
u 

6h 
U 
CJ 
H 

a 
u 

H 

u 
o 
o 

o 



a. 



co 





H 






U 






H 


CO 




< 






H 








(J 




< 












U 


>-J 


o 


U 






u 






< 






u 






u 








r a 




u 






H 






U 












U 








if 




U 






U 






% 


r. 




GT 


*> 




O 










O 


O 


*> 




O 




CO 








U 






U 












a 


> 




a 






H 






U 






a 












u 






< 






u 






cj 






u 












< 


CO 




o 




o 






r- 


U 




00 


H 




i-H 


a 






o 












a 








o 




u 






o 






< 


CO 



o 



o 
in 



o 
m 



O 

U 
< 

O 

u 
< 



< 

u 
u 

H 

u 
< 

< 

u 

< 

u 



CO 



SUBSTITUTE SHEET (RULE 26) 



WO 97/40152 ^fc ^PCTAJS97/06281 



7/5 1 



6 



O 
(N 



O 



O 

cn 



H 
O 
< 

u 

U 
< 

o 
o 
< 

H 
O 
< 
U 
U 
H 
H 



O 

U 
O 
< 

< 

e> 

H 

u 

U 

o 

H 

u 
u 

H 
H 

e> 

H 
U 
H 

U 

u 
< 



O 

o\ 

O 



O 

o 

CN 



O 

in 
o 

CN 



H 

H 
H 

H 
£h 
Eh 
< 

a 
u 



o 
in 



o 
m 



CN 



U 
U 
H 
< 
U 
U 

fcH 
U 

a 

H 
U 
< 

u 
o 

< 

o 

U 
U 



CM 
CM 



O 



CN 



O 
U 

o 
a 
o 

H 

u 
a 

H 
H 

H 

o 
< 

< 

O 





O 








H 








U 




Eh 








H 




U 
















Eh 




< 




U 




Eh 




o 




a 


o 


o 


o 




tH 


o 




U 


tH 


o 




a 


CN 




<N 






E-* 




< 












U 




< 




u 




o 




H 




u 




< 




a 











o 
r- 

CN 
CM 



O 

in 

CN 



(N 

cn 



Eh 
< 

O 
Eh 
*t 

a 

< 

U 

< 

Eh 
< 

Eh 
< 

Eh 
< 

o 

< 

a 
u 

H 

a 

H 

o 

















to 




2 

o 




< 






o 


< 


m 


Eh 


m 


u 


CN 






< 




H 




U 




u 




H 












U 




U 




O 




a 








u 




H 




H 




u 


o 


< 


iH 


Eh 


m 




rsi 


H 












< 




u 
5 




tt; 




H 








< 







o 

CN 
CN 



SUBSTITUTE SHEET (RULE 26) 



WO 97/4015; 



* 



PCTAJS97/06281 



8/5 1 



CD 

d 

LL 



O 

cn 
m 



o 
m 



o 
in 
n 

CN 



< 
< 

U 

Eh 

H 

< 

o 

H 

< 

U 
< 
O 

U 
O 

U 

< 

< 

U 
U 
U 
H 
U 

H 
O 
O 

u 

H 

H 
< 

2 





O 






O 












o 






< 






a 






u 




o 


u 


o 


in 


< 






Eh 


in 






CM 




U 






























2 

o 







U 






< 






u 






u 






o 




o 




O 


CO 




as 




H 






H 












H 






U 






o 






u 






u 






o 












< 






O 




























O 


< 


o 


rH 


E* 






u 






u 


CM 




o 






< 






u 


















H 






< 





o 
u 
< 
u 
u 

H 
H 
H 
H 
< 
U 

o 
< 

a 
< 

O 
< 
< 
< 

H 
U 
U 
< 
U 

H 
O 
H 
< 
O 

U 
< 

o 
a 

u 
o 
< 



o 
in 
in 



in 



U 
< 

H 
H 
U 
H 

U 

o 

H 

U 

Eh 

H 

o 
< 

U 



o 

H 

U 

$ 

O 
H 
< 

Eh 

H 
O 

H 
< 

O 
H 
U 



SUBSTITUTE SHEET (RULE 26) 



WO 97/40152 ^RCTYUS97/06281 



8/5 1 

FIG. 2 



M F 

V S 
S C 
S L 
Q K 
R I 

V E 

V F 
C N 
S V 
D K 
C T 
P N 
A D 

n y 

y i 

D L 

A I 

Q T 

A G 

0 K 

E K 

S C 



L A 

G G 

S T 

A S 

S L 

Y W 
P V 

Q Q 

L D 

S N 

V P 
E R 
C L 
F F 
A D 
D S 
E E 
Q A 
T T 
3 E 
L K 



D R 

k y 

G L 

Y N 
S M 
N S 

V E 
D I 
D V 
A K 
R R 
N L 
T N 
C L 
S S 
C L 
F G 
A T 
N E 
3 N 



Y F A 
L D C 
R T L 
E A K 
C R C 

Y Q S 
R L S 
HIP 
C K K 
CNR 
H S Y 
Q T I 
Q D S 
C Q P 
LAY 
L S V 
K F L 
N G S 
T T T 
I P T 

Y S G 
ASS 
L L V 
465 



L P 

V K 

R Q 

D E 

K R 

L Q 

D I 

K G 

V R 
R K 
G M 

V P 
C K 
E S 
S G 
A P 
N F 
D V 
A L 
H V 

n ? 

H I 

L V 



L L D 

A S D 

C V A 

C R S 

G M K 

G N D 

F R V 

N N C 
SAY 

C H K 

L F C 

V C S 

T N Y 

R S V 

L I G 

W C D 

F K D 

T V W 

R V K 
LP? 

H L Z 

"T ™ V 

VTA 



L L L 

Q C L 

G K E 

A M E 

K E K 

L L E 

V P F 
L D A 
I T P 
A L R 
SCR 

Y E E 
ICR 
S S C 
T V M 
C S N 
N T C 
Q P A 
N K P 
CAN 
I S N 
3 M A 
L 3 T 



S A E 

K E Q 

T N F 

A L K 

N C L 
DSP 

I S D 

A K A 

C T T 

Q F F 

D I A 

R E K 

S R L 

L K E 

T P N 

S G N 

L K N 

F ? V 

L G ? 

L Q A 

G N Y 

A ? ? 

L L 3 



20 

AO 

60 

80 

100 

120 

140 

160 

180 

200 

220 

240 

260 

280 

300 

320 

340 

360 

380 

400 

420 

440 

46C 



SUBSTITUTE SHEET (RULE 26) 



WO 97/401^M PCT/US97/06281 



10/51 

FIG. 3 

10 30 5 0 

AGCTCGCTCTCCCGGGGCAGTGGTGTGGATGCACCGGAGTTCGGGCGCTGGGCAAGTTGG 

70 90 no 

GTCGGAACTGAACCCCTGAAAGCGGGTCCGCCTCCCGCCCTCGCGCCCGCCCGGATCTGA 
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